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Field of the Invention 

The present invention relates to two novel human genes, each of which encodes 
a polypeptide which is a member of the Interleukin family. More specifically, the 
5 present invention relates to a polynucleotide encoding a novel human polypeptide 
named Interleukin-21, or "IL-2r\ The present invention also relates to a 
polynucleotide encoding a novel human polypeptide named Interleukin-22, or "IL-22". 
This invention also relates to IL-21 and IL-22 polypeptides, as well as vectors, host 
cells, antibodies directed to TL-21 and IL-22 polypeptides, and the recombinant 
10 methods for producing the same. Also provided are diagnostic methods for detecting 
disorders related to the Immune system, and therapeutic methods for treating such 
^ disorders. The invention further relates to screening methods for identifying agonists 

p and antagonists of JL-21 and IL-22 activity. 

09 

M 

y Background of the Invention 

;P 15 Cytokines typically exert their respective biochemical and physiological effects 

by binding to specific receptor molecules. Receptor binding then stimulates specific 
signal transduction pathways (Kishimoto, T., et ai, Cell 76:253-262 (1994)). The 
specific interactions of cytokines with their receptors are often the primary regulators of 



□ 



l0 a wide variety of cellular processes including activation, proliferation, and 

& 20 differentiation (Arai, K. -I, et aU Ann. Rev. Biochem. 59:783-836 (1990); Paul, W. 

09 E. and Seder, R. A., Cell 76:241-251 (1994)). 

Human interleukin (IL)-17, a closely related homolog of the molecules of the 
present invention, was only recently identified. IL-17 is a 155 amino acid polypetide 
which was molecularly cloned from a CD4+ T-cell cDNA library (Yao, Z., et ai, J. 
25 Immunol. 155:5483-5486 (1995)). The IL-17 polypeptide contains an N-terminal 
signal peptide and contains approximately 72% identity at the amino acid level with a 
T-cell trophic herpesvirus saimiri (H VS) gene designated H VS 1 3. High levels of TL- 1 7 
are secreted from CD4-positive primary peripheral blood leukocytes (PBL) upon 
stimulation (Yao, Z., et ai, Immunity 3:81 1-821 (1995)). Treatment of fibroblasts 
30 with IL- 1 7, HVS 13, or another murine homologue, designated CTLA8, activate signal 
transduction pathways and result in the stimulation of the NF-kB transcription factor 
family, the secretion of IL-6, and the costimulation of T-cell proliferation (Yao, Z., et 
ai. Immunity 3:81 1-821 (1995)). 

An HVS13-Fc fusion protein was used to isolate a murine TL-17 receptor 
35 molecule which does not appear to belong to any of the previously described cytokine 



receptor families (Yao, Z., et al, Immunity 3:81 1-821 (1995)). The murine IL-17 
receplor (mIL-17R) is predicted to encode a type I transmembrane protein of 864 amino 
acids with an apparent molecular mass of 97.8 kDa. mIL-1 7R is predicted to possess 
an N-terminal signal peptide with a cleavage site between alanine-31 and serine-32. 
The molecule also contains a 291 amino acid extracellular domain, a 21 amino acid 
transmembrane domain, and a 521 amino acid cytoplasmic tail. A soluble recombinant 
IL-17R molecule consisting of 323 amino acids of the extracellular domain of IL-17R 
fused to the Fc portion of human immunoglobulin IgGl was able to significantly inhibit 
IL-17-induced 1L-6 production by murine NIH-3T3 cells {supra). 

Interestingly, the expression of the IL- 17 gene is highly restricted. It is 
typically observed primarily in activated T-lymphocyte memory cells (Broxmeyer, H. J. 
Exp. Med. 183:2411-2415 (1996); Fossiez, F., et al, J. Exp. Med. 183:2593-2603 
(1996)). Conversely, the IL-17 receptor appears to be expressed in a large number of 
cells and tissues (Rouvier, E., et al., J. Immunol 150:5445-5456 (1993); Yao, Z., et 
al, J. Immunol 155:5483-5486 (1995)). It remains to be seen, however, if IL-17 
itself can play an autocrine role in the expression of TL-17. IL-17 has been implicated 
as a causitive agent in the expression of IL-6, IL-8, G-CSF, Prostaglandin E (PGEj), 
and intracellular adhesion molecule (ICAM)-l (Fossiez, F., supra; Yao, Z., et al. f 
Immunity 3:81 1-821 (1995)). Each of these molecules possesses highly relevent and 
potentially therapeutically valuable properties. For instance, IL-6 is involved in the 
regulation of hematopoietic stem and progenitor cell growth and expansion (Ikebuchi, 
K., etal, Proc. Natl. Acad. Scl USA 84:9035-9039 (1987); Gentile, P. and 
Broxmeyer, H. E. Ann. N.Y. Acad. Scl USA 628:74-83 (1991)). EL-8 exhibits a 
myelosuppressive activity for stem cells and immature subsets of myeloid progenitors 
(Broxmeyer, H. E., et al, Ann. Hematol 71:235-246 (1995); Daly, T. J., et al, J. 
Biol Chem. 270:23282-23292 (1995)). G-CSF acts both early and late to activate and 
stimulate hematopoiesis in general, and more specifically on neutrophil hematopoiesis, 
while PGEj enhances erythropoiesis, suppresses lymphopoiesis and myelopoiesis in 
general, and strongly suppresses monocytopoiesis (Broxmeyer, H. E. Amer. J. Ped. 
Hematol/Oncol 14:22-30 (1992); Broxmeyer, H. E. and Williams, D. E. CRC Crit. 
Rev. Oncol/HematoL 8: 173-226 (1988)). 

Thus, there is a need for polypeptides that function as immunoregulatory 
molecules and, thereby, modulate the transfer of an extracellular signal ultimately to the 
nucleus of the cell, since disturbances of such regulation may be involved in disorders 
relating to cellular activation, hemostasis, angiogenesis, tumor metastasis, cellular 
migration and ovulation, as well as neurogenesis. Therefore, there is a need for 




identification and characterization of such human polypeptides which can play a role in 
detecting, preventing, ameliorating or correcting such disorders. 

Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
5 polypeptides of IL-21 and IL-22. Moreover, the present invention relates to vectors, 
host cells, antibodies, and recombinant methods for producing the polypeptides and 
polynucleotides. Also provided are diagnostic methods for detecting disorders related 
to the polypeptides, and therapeutic methods for treating such disorders. The invention 
further relates to screening methods for identifying binding partners of IL-21 and 
10 IL-22. 

m 

q Brief Description of the Drawings 

□ Figure 1 shows the nucleotide sequence (SEQ ID NO: 1) and the deduced amino 

^ acid sequence (SEQ ID NO:2) of IL-21. The locations of conserved Domains I-IV (see 

y below) are underlined and labeled as such. 

j£ 15 Figure 2 shows the nucleotide sequence (SEQ ID NO:3) and the deduced amino 

s acid sequence (SEQ ID NO:4) of IL-22. The locations of conserved Domains I-IV (see 

B below) are underlined and labeled as such. The locations of two potential N-linked 

glycosylation sites are identified by a bolded asparagirie symbol (N) accompanied by a 
tfj bolded pound sign (#) located above the initial nucleotide of the codon encoding the 

20 corresponding asparagine: 



Figure 3 shows the regions of identity between the amino acid sequence of the 
IL-21 and TL-22 proteins, the amino acid sequence of IL-20 (disclosed in copending 
U.S. Provisional Application Serial No. 60/060,140; filed September 26, 1997; SEQ 
ID NO: 8), and the translation products of human Interleukin- 17 (ATCC Accession No. 

25 U32659; SEQ ID NO:5), mouse Tnterleukin- 17 (ATCC Accession No. U43088; SEQ 
ID NO:6), and viral Interleukin- 17 (ATCC Accession No. X64346; SEQ ID NO:7), as 
determined by the Meg Align component of the computer program DNA*Star 
(DNASTAR, Inc.) using the default parameters. 

Figure 4 shows an analysis of the IL-21 amino acid sequence. Alpha, beta, turn 

30 and coil regions; hydrophilicity and hydrophobicity; amphipathic regions; flexible 

regions; antigenic index and surface probability are shown. In the "Antigenic Index" or 
"Jameson-Wolf graph, the positive peaks indicate locations of the highly antigenic 
regions of the IL-21 protein, that is, regions from which epitope-bearing peptides of the 
invention can be determined. The domains defined by these graphs are contemplated by 

35 the present invention. 



Figure 5 shows an analysis of the IL-22 amino acid sequence. Alpha, beta, turn 
and coil regions; hydrophilicity and hydrophobicity; amphipathic regions; flexible 
regions; antigenic index and surface probability are shown. Tn the "Antigenic Index" or 
"Jameson-Wolf graph, the positive peaks indicate locations of the highly antigenic 
regions of the TL-22 protein, that is, regions from which epitope-bearing peptides of the 
invention can be determined. The domains defined by these graphs are contemplated by 
the present invention. 

Detailed Description 
Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. 

In the present invention, a "secreted" IL-21 or IL-22 protein refers to a protein 
capable of being directed to the ER, secretory vesicles, or the extracellular space as a 
result of a signal sequence, as well as an TL-21 or IL-22 protein released into the 
extracellular space without necessarily containing a signal sequence. If the IL-21 or 
IL-22 secreted protein is released into the extracellular space, the IL-21 or IL-22 
secreted protein can undergo extracellular processing to produce a "mature" IL-2 1 or 
IL-22 protein. Release into the extracellular space can occur by many mechanisms, 
including exocytosis and proteolytic cleavage. 

As used herein , an IL-21 or IL-22 "polynucleotide" refers to a molecule having 
a nucleic acid sequence contained in SEQ ID NO: 1 or in SEQ ID NO:3, respectively, or 
the cDNA contained within the respective clones deposited with the ATCC. For 
example, the IL-21 or IL-22 polynucleotide can contain the nucleotide sequence of the 
full-length cDNA sequence, including the 5' and 3* untranslated sequences, the coding 
region, with or without the signal sequence, the secreted protein coding region, as well 
as fragments, epitopes, domains, and variants of the nucleic acid sequence. Moreover, 
. as used herein, an IL-21 or TL-22 "polypeptide" refers to a molecule having the 
translated amino acid sequence generated from the polynucleotide as broadly defined. 

In the present invention, the full-length IL-21 sequence identified as SEQ ID 
NO: 1 was generated by overlapping sequences contained in multiple clones (con tig 



analysis). A representative clone containing all or most of the sequence for SEQ TD 
NO:l (designated HTGED19) was deposited with the American Type Culture 
Collection ("ATCC") on March 5, 1998, and was given the ATCC Deposit Number 
209666. In addition, and also in the present invention, the full-length IL-22 sequence 
identified as SEQ ID NO:3 was also generated by overlapping sequences contained in 
multiple clones (contig analysis). Likewise, a representative clone containing all or 
most of the sequence for SEQ ID NO:3 (designated HFPBX96) was also deposited 
with the ATCC on March 5, 1998, and was given the ATCC Deposit Number 209665. 
The ATCC is located at 10801 University Blvd., Manasas, VA 201 10-2209 , USA. 
The ATCC deposit was made pursuant to the terms of the Budapest Treaty on the 
international recognition of the deposit of microorganisms for purposes of patent 
procedure. 

An IL-21 "polynucleotide" also includes those polynucleotides capable of 
hybridizing, under stringent hybridization conditions, to sequences contained in SEQ 
TD NO: 1 , the complement thereof, or the cDNA within the deposited clone. Further, 
An IL-22 "polynucleotide" also includes those polynucleotides capable of hybridizing, 
under stringent hybridization conditions, to sequences contained in SEQ ID NO:3, the 
complement thereof, or the cDNA within the deposited clone. "Stringent hybridization 
conditions" refers to an overnight incubation at 42"C in a solution comprising 50% 
formamide, 5x SSC (750 mM NaCI, 75 mM sodium citrate), 50 mM sodium phosphate 
(pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 ug/ml denatured, 
sheared salmon sperm DN A, followed by washing the filters in 0. lx SSC at about 
65'C. 

Also contemplated are nucleic acid molecules that hybridize to the IL-21 and the 
IL-22 polynucleotides at lower stringency hybridization conditions. Changes in the 
stringency of hybridization and signal detection are primarily accomplished through the 
manipulation of formamide concentration (lower percentages of formamide result in 
lowered stringency); salt conditions, or temperature. For example, lower stringency 
conditions include an overnight incubation at 37 *C in a solution comprising 6X SSPE 
(20X SSPE = 3M NaCI; 0.2M NaH 2 P0 4 ; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% 
formamide, 100 ug/ml salmon sperm blocking DNA; followed by washes at 50°C with 
1XSSPE, 0.1% SDS. In addition, to achieve even lower stringency, washes 
performed following stringent hybridization can be done at higher salt concentrations 
(e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 



Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, due 
to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
as any 3* terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a polyA+ stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone). 

The IL-21 and IL-22 polynucleotides can be composed of any 
polyribonucleotide or polydeoxyribonucleotide, which may be unmodified RNA or 
DNA or modified RNA or DNA. For example, the IL-21 and IL-22 polynucleotides 
can be composed of single- and double-stranded DNA, DNA that is a mixture of single- 
and double-stranded regions, single- and double-stranded RNA, and RNA that is 
mixture of single- and double-stranded regions, hybrid molecules comprising DNA and 
RNA that may be single-stranded or, more typically, double-stranded or a mixture of 
single- and double-stranded regions. In addition, the IL-21 polynucleotides can be 
composed of triple-stranded regions comprising RNA or DNA or both RNA and DNA. 
IL-21 polynucleotides may also contain one or more modified bases or DNA or RNA 
backbones modified for stability or for other reasons. "Modified" bases include, for 
example, tritylated bases and unusual bases such as inosine. A variety of modifications 
can be made to DNA and RNA; thus, "polynucleotide" embraces chemically, 
enzymatically, or metabolically modified forms. 

IL-21 and IL-22 polypeptides can be composed of amino acids joined to each 
other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 
contain amino acids other than the 20 gene-encoded amino acids. The TL-21 and IL-22 
polypeptides may be modified by either natural processes, such as posttranslational 
processing, or by chemical modification techniques which are well known in the art. 
Such modifications are well described in basic texts and in more detailed monographs, 
as well as in a voluminous research literature. Modifications can occur anywhere in the 
IL-21 and IL-22 polypeptides, including the peptide backbone, the amino acid 
side-chains and the amino or carboxyl termini. It will be appreciated that the same type 
of modification may be present in the same or varying degrees at several sites in a given 
IL-2 1 or IL-22 polypeptide. Also, a given IL-21 or IL-22 polypeptide may contain 
many types of modifications. IL-21 or IL-22 polypeptides may be branched, for 
example, as a result of ubiquitinauon, and they may be cyclic, with or without 



branching. Cyclic, branched, and branched cyclic IL-21 and IL-22 polypeptides may 
result from posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond . 
formation, demethylation, formation of covalent cross-links, formation of cysteine, 
formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 
anchor formation, hydroxylation, iodination, methylation, myristoylalion, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins 
such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. 
H. Freeman and Company, New York (1993); POSTTRANSLATION AL 
COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic 
Press, New York, pgs. 1-12 (1983); Seifter, et aL Meth. Enzymol 182:626-646 
(1990); Rattan, et al, Ann. NY Acad. Set 663:48-62 (1992)). 

"SEQ ID NO:l M refers to an IL-21 polynucleotide sequence while "SEQ ID 
NO:2" refers to an IL-21 polypeptide sequence. Likewise, "SEQ ID NO:3" refers to an 
IL-22 polynucleotide sequence while "SEQ ID NO:4" refers to an IL-22 polypeptide 
sequence. 

An IL-21 polypeptide "having biological activity" refers to polypeptides 
exhibiting activity similar, but not necessarily identical to, an activity of an IL-21 
polypeptide, including mature forms, as measured in a particular biological assay, with 
or without dose-dependency. In addition, an IL-22 polypeptide "having biological 
activity" refers to polypeptides exhibiting activity similar, but not necessarily identical 
to, an activity of an IL-22 polypeptide, including mature forms, as measured in a 
particular biological assay, with or without dose-dependency. In the case where 
dose-dependency does exist, it need not be identical to that of the IL-2 1 or IL-22 
polypeptide, but rather substantially similar to the dose-dependence in a given activity 
as compared to the IL-21 or IL-22 polypeptides (i.e., the candidate polypeptide will 
exhibit greater activity or not more than about 25-fold less and, preferably, not more 
than about tenfold less activity, and most preferably, not more than about three-fold less 
activity relative to the IL-21 polypeptide). 

IL-21 and IL-22 Polynucleotides and Polypeptides 
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Clone HTGED19, encoding IL-21, was isolated from acDNA library derived 
from apoptotic T-cells. This clone contains the entire coding region identified as SEQ 
ID NO:2. The deposited clone contains a cDNA having a total of 705 nucleotides, 
which encodes a partial predicted open reading frame of 87 amino acid residues (see 
Figure 1). The partial open reading frame begins at a point in the complete IL-21 ORF 
such that the "G" in position 1 of SEQ TD NO: 1 is actually in position 3 of a coding 
triplet. As such, the partial predicted IL-21 polypeptide sequence is shown beginning 
in-frame with an alanine residue at position 1 of SEQ ID NO:2. The alanine residue at 
position 1 of SEQ ID NO:2 is encoded by nucleotides 2-4 of the nucleotide sequence 
shown as SEQ ID NO: 1 . The ORF shown as SEQ ID NO:2 ends at a stop codon at 
nucleotide position 263-265 of the nucleotide sequence shown as SEQ ID NO: 1 . The 
predicted molecular weight of the partial TL-21 protein should be about 9,558 Daltons. 

An initial BLAST analysis of the expression of the IL-21 cDNA sequence 
against the HGS EST database has also revealed a highly specific expression of this 
cDNA clone. In such an analysis, the HTGED19 cDNA sequence appears to be found 
only in apoptotic T-cells. Thus, IL-21 appears to be expressed in a highly restricted 
pattern limited to apoptotic T-celts, and, for example, other subpopulatons of 
lymphocytes or other cells in a state of activation or quiescence. 

Clone HFPBX96, encoding IL-22, was isolated from a cDNA library derived 
from epileptic frontal cortex. This clone contains the entire coding region identified as 
SEQ ID NO:4. The deposited clone contains a cDN A having a total of 1 ,642 
nucleotides, which encodes a partial predicted open reading frame of 160 amino acid 
residues (see Figure 2). The partial open reading frame begins at a point in the 
complete IL-22 ORF such that the "G" in position 1 of SEQ TD NO:3 is actually in 
position two of a coding triplet. As such, the partial predicted IL-22 polypeptide 
sequence is shown beginning in- frame with an asparagine residue at position 1 of SEQ 
ID NO:4. The asparagine residue at position 1 of SEQ ID NO:4 is encoded by 
nucleotides 3-5 of the nucleotide sequence shown as SEQ ID NO:3. The ORF shown 
as SEQ ID NO:4 ends at a stop codon at nucleotide position 483-485 of the nucleotide 
sequence shown as SEQ ID NO:3. The predicted molecular weight of the partial IL-22 
protein should be about 17,436 Daltons. 

Using BLAST and MegAlign analyses, SEQ ID NO:2 and SEQ ID NO:4 were 
each found to be highly homologous to several members of the Interleukin family. 
Particularly, SEQ ID NO:2 and SEQ ID NO:4 contain at least four domains 
homologous to the translation products of the human mRNA for Interleukin (IL)-20 
(copending U.S. Provisional Application Serial No. 60/060,140; filed September 26, 
1997; SEQ 3D NO:8), IL- 17 (ATCC Accession No. U32659; SEQ ID NO:5: see also 




9 

Figure 3), the murine mRNA for Interleukin (IL)-I7 (ATCC Accession No. U43088; 
SEQ TD NO:6; see also Figure 3), and the human viral mRNA for Interleukin (IL)-17 
(ATCC Accession No. X64346; SEQ ID NO:7; see also Figure 3). Specifically, the 
molecules of ihe present invention, in particular, SEQ ID NO:2 and SEQ ID NO:4, 
5 share a high degree of sequence identity with EL-20, LL- 17, mJDL-17, and vTL-17 in the 
following conserved domains: (a) a predicted NXDPXRYP domain (where X 
represents any amino acid) located at about amino acids valine-3 to proline- 1 1 of SEQ 
ID NO:2, serine-57 to proline-64 of SEQ ID NO:4, and asparagine-79 to proline-86 of 
the human IL-17 amino acid sequence (SEQ ID NO:5); (b) a predicted CLCXGC 

10 domain (where X represents any amino acid) located at about amino acids cysteine- 19 
to cysteine-24 of SEQ ID NO;2, cysteine-72 to cysteine-77 of SEQ TD NO:4, and 
cysteine-94 to cysteine-99 of the human IL-17 amino acid sequence (SEQ ID NO:5); (c) 
a predicted LVLRRXP domain (where X represents any amino acid) located at about 
amino acids leucine-46 to proline-52 of SEQ ID NO:2, valine-99 to proline-105 of SEQ 

15 ID NO:4, and leucine- 120 to proline- 126 of the human TL-17 amino acid sequence 
(SEQ ID NO:5); and (d) a predicted VXVGCTCV domain (where X represents any 
amino acid) located at about arnino acids valine-75 to valine-82 of SEQ ID NO:2, 
isolcucirie-121 to valine-128 of SEQ ID NO:4, and valine-140 to valine-147 of the 
human IL-17 amino acid sequence (SEQ ID NO:5). These polypeptide fragments of 

20 IL-21 are specifically contemplated in the present invention. Because each of these 
IL-17 and IL-17-like molecules is thought to be important immunoregulatory 
molecules, the homology between these IL-17 and IL-17-like molecules and IL-21 and 
TL-22 suggests that IL-21 and IL-22 may also be important immunoregulatory 
molecules. 

25 Moreover, based on their apparent sequence identities with IL-17 and IL-20 (see 

Figure 3), the full-length IL-21 and IL-22 polypeptides are each likely to have an amino 
terminal secretory signal peptide leader sequence. Since the present invention appears 
to be partial cDNA clones of the IL-21 and IL-22 molecules, it is also contemplated that 
the translation products of the present invention will be caused to enter the cellular 

30 secretory pathway by virtue of being expressed as a fusion proteins comprising several 
different portions of the N-terminus of the IL-20 molecule of copending U.S. 
Provisional Application Serial No. 60/060,140 fused to the known coding sequence of 
the TL-21 or IL-22 molecules of the present invention. Such expression constructs will 
. secrete hybrid IL-20/IL-21 or IL-20/EL-22 molecules from the host cell. The mature 

35 DL-2 1 protein used in these fusion proteins encompasses about amino acids 1-145, 

while the IL-20/21 fusion protein encompasses about the 104 or 1 13 N-terminal amino 
acids of IL-20 encoded in frame with about amino acids I- 145 of IL-21. These 
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polypeptide fragments of IL-21 are specifically contemplated in the present invention. 
In addition, the mature IL-22 protein used to generate these fusion proteins 
encompasses about amino acids 1-1 60, while the 1L-20/22 fusion protein encompasses 
about the 95, 104 or 1 1 3 N-terminal amino acids of IL-20 encoded in frame with about 
amino acids 1-160 of IL-22. These polypeptide fragments of IL-22 are specifically 
contemplated in the present invention. 

The IL-21 and IL-22 nucleotide sequences identified as SEQ ID NO:l and SEQ 
ID NO: 3, respectively, were assembled from partially homologous ("overlapping") 
sequences obtained from the deposited clones. The overlapping sequences were 
assembled into two single contiguous sequences of high redundancy (usually three to 
five overlapping sequences at each nucleotide position), resulting in two final sequences 
identified as SEQ ID NO: 1 and SEQ ID NO:3. 

Therefore, SEQ ID NO: 1 and the translated SEQ ID NO:2, and SEQ ID NO:3 
and the translated SEQ ID NO:4, are sufficiently accurate and otherwise suitable for a 
variety of uses well known in the art and described further below. For instance, SEQ 
ID NO: 1 and SEQ ID NO:3 are useful for designing nucleic acid hybridization probes 
that will detect nucleic acid sequences contained in SEQ ID NO: 1 , SEQ ID NO:3, or the 
cDNA contained in the respective deposited cDNA clones. These probes will also 
hybridize to nucleic acid molecules in biological samples, thereby enabling a variety of 
forensic and diagnostic methods of the invention. Similarly, polypeptides identified 
from SEQ ID NO:2 and SEQ ID NO:4 may be used to generate antibodies which bind 
specifically to IL-21 and IL-22, respectively. 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino acid 
sequence. In these cases, the predicted amino acid sequence diverges from the actual 
amino acid sequence, even though the generated DNA sequence may be greater than 
99.9% identical to the actual DNA sequence (for example, one base insertion or deletion 
in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 
sequence or the amino acid sequence, the present invention provides not only the 
generated nucleotide sequence identified as SEQ ID NO:l and the predicted translated 
amino acid sequence identified as SEQ ID NO:2, but also a sample of plasmid DNA 
containing a human cDNA of TL-21 deposited with the ATCC. In addition, the present 
invention also provides not only the generated nucleotide sequence identified as SEQ ID 
NO: 3 and the predicted translated amino acid sequence identified as SEQ ID NO:4, but 
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also a sample of plasmid DNA containing a human cDNA of IL-22 deposited with the 
ATCC. Accordingly, the nucleotide sequence of the deposited IL-21 and IL-22 clones 
can be readily determined by sequencing the deposited clone in accordance with known 
methods. The predicted IL-21 and IL-22 amino acid sequences can then be verified 
5 from such deposits. Moreover, the amino acid sequence of the protein encoded by the 
deposited clone can also be directly determined by peptide sequencing or by expressing 
the protein in a suitable host cell containing the deposited human IL-21 or IL-22 
cDNAs, collecting the protein, and determining its sequence. 

The present invention also relates to the IL-21 gene corresponding to SEQ ID 
1 0 NO: 1 , SEQ ID NO:2, or the deposited clone which encodes IL-2 1 . The present 
invention further relates to the IL-22 gene corresponding to SEQ ID NO:3, SEQ ID 
NO:4, or the deposited clone which encodes IL-22. The IL-21 and IL-22 genes can be 
p isolated in accordance with known methods using the sequence information disclosed 

O herein. Such methods include preparing probes or primers from the disclosed 

{jj 15 sequences and identifying or amplifying the IL-21 and IL-22 genes from appropriate 

sources of genomic material. 
jP Also provided in the present invention are species homologs of IL-21 and 

IL-22. Species homologs may be isolated and identified by making suitable probes or 
9 primers from the sequences provided herein and screening a suitable nucleic acid source 

20 for the desired homolog. 
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$ The IL-21 and IL-22 polypeptides can be prepared in any suitable manner. 

j£j Such polypeptides include isolated naturally occurring polypeptides, recombinantly 

produced polypeptides, synthetically produced polypeptides, or polypeptides produced 
by a combination of these methods. Means for preparing such polypeptides are well 
25 understood in the art. 

The IL-21 and IL-22 polypeptides may be in the form of the secreted protein, 
including the mature form, or may be a part of a larger protein, such as a fusion protein. 
It is often advantageous to include ah additional amino acids which comprise secretory 
or leader sequences, pro-sequences, sequences which aid in purification, such as 
30 multiple histidine residues, or an additional sequence for stability during recombinant 
production. 

IL-21 and IL-22 polypeptides are preferably provided in an isolated form, and 
preferably are substantially purified. A recombinantly produced version of an IL-21 or 
IL-22 polypeptide, including the secreted polypeptide, can be substantially purified by 
35 the one-step method described in the publication by Smith and Johnson (Gene 

67:31-40 (1988)). IL-21 and IL-22 polypeptides also can be purified from natural or 
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recombinant sources using antibodies of the invention raised against the IL-21 and 
IL-22 proteins, respectively, in methods which are well known in the art. 

Polynucleoti de and Polypeptide Variqntff 

"Variant" refers to a polynucleotide or polypeptide differing from the TL-21 and 
IL-22 polynucleotides or polypeptides, but retaining essential properties thereof. 
Generally, variants are overall closely similar, and, in many regions, identical to the 
IL-21 and IL-22 polynucleotide or polypeptide. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended that 
the nucleotide sequence of the polynucleotide is identical to the reference sequence 
except that the polynucleotide sequence may include up to five point mutations per each 
100 nucleotides of the reference nucleotide sequence encoding the IL-21 or IL-22 
polypeptides. In other words, to obtain a polynucleotide having a nucleotide sequence 
at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in 
the reference sequence may be inserted, deleted or substituted with another nucleotide. 
The query sequence may be an entire sequence shown of SEQ ID NO: 1 or SEQ ID 
NO:3, the ORF (open reading frame) of either IL-2 1 or IL-22, or any fragement 
specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to (or 10%, 5%, 
4%, 3%, 2% or 1% different from) a nucleotide sequence of the presence invention can 
be determined conventionally using known computer programs. A preferred method 
for determing the best overall match between a query sequence (a sequence of the 
present invention) and a subject sequence, also referred to as a global sequence 
alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag and colleagues (Comp. App. BioscL 6:237-245 (1990)). In a 
sequence alignment the query and subject sequences are both DNA sequences. An 
RNA sequence can be compared by converting (uridine residues (U) to thymidine 
residues (T). The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB alignment of DNA sequences to calculate 
percent identiy are: Matrix=Unitary, k-tuple=4, Mismatch Penal ty=l, Joining 
Penalty=30, Randomization Group Length=0, Cutoff Score=l , Gap Penalty=5, Gap 
Size Penalty 0.05, Window Size=500 or the length of the subject nucleotide sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3* 
deletions, but not because of internal deletions, a manual correction must be made to the 
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results. This is becuase the FASTDB algorithm docs not account for 5' and 3' 
truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3* ends, relative to the the query sequence, the percent 
identity is corrected by calculating the number of bases of the query sequence that are 5' 
5 and 3' of the subject sequence, which are not matched/aligned, as a percent of the total 
bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from 
the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This corrected score is what is 
10 used for the purposes of the present invention. Only bases outside the 5' and 3" bases 
of the subject sequence, as displayed by the FASTDB alignment, which are not 
matched/aligned with the query sequence, are calculated for the purposes of manually 
^ adjusting the percent identity score. 

q For example, a 90 base subject sequence is aligned to a 1 00 base query 

1 5 sequence to determine percent identity. The deletions occur at the 5* end of the subject 
y sequence and therefore, the FASTDB alignment does not show a matched/alignment of 

«P the first 10 bases at 5' end. The 10 unpaired bases represent 10% of the sequence 

P ((number of bases at the 5' and 3' ends not matched)/(total number of bases in the 

B query sequence)), so 10% is subtracted from the percent identity score calculated by (he 

^ 20 FASTDB program. If the remaining 90 bases were perfectly matched the final percent 

j3 identity would be 90%. Tn another example, a 90 base subject sequence is compared 

vB with a 100 base query sequence. This time the deletions are internal deletions so that 

^ there are no bases on the 5' or 3* of the subject sequence which are not matched/aligned 

with the query. In this case the percent identity calculated by FASTDB is not manually 
25 corrected. Once again, only bases 5* and 3' of the subject sequence which are not 
matched/aligned with the query sequnce are manually corrected for. No other manual 
corrections are to made for the purposes of the present invention. 

By a polypeptide having an amino acid sequence which is, at least, for example, 
95% "identical" to (or 5% different from) a query amino acid sequence of the present 
30 invention, it is intended that the amino acid sequence of the subject polypeptide is 

identical to the query sequence except that the subject polypeptide sequence may include 
up to five amino acid alterations per each 100 amino acids of the query amino acid 
sequence. In other words, to obtain a polypeptide having an amino acid sequence at 
least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues 
35 in the subject sequence may be inserted, deleted, (insertions and deletions are 

collectively referred to as "indels" in the art) or substituted with another amino acid. 
These alterations of the reference sequence may occur at the amino- or carboxy-terminal 
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positions of the reference amino acid sequence or anywhere between those terminal 
positions, interspersed either individually among residues in the reference sequence or 
in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 
5 96%, 97%, 98% or 99% identical to (or 10%, 5%, 4%, 3%, 2% or 1% different from), 
for instance, the amino acid sequences shown in SEQ ID NO:2, or that shown in SEQ 
ID NO:4, or to the amino acid sequence encoded by deposited cDNA clones, can be 
determined conventionally using known computer programs. A preferred method for 
determing the best overall match between a query sequence (a sequence of the present 
10 invention) and a subject sequence, also referred to as a global sequence alignment, can 
be determined using the FASTDB computer program based on the algorithm of Brutlag 
and colleagues (Comp. App. Biosci. 6:237-245 (1990)). In a sequence alignment the 
query and subject sequences are either both nucleotide sequences or both amino acid 



p sequences. The result of said global sequence alignment is in percent identity. 

15 Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, 

y k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, Randomization Group Length=0, 

=P Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size 

f Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, 

□ whichever is shorter. 

j^J 20 If the subject sequence is shorter than the query sequence due to N- or 

C-terminal deletions, not because of internal deletions, a manual correction must be 

*S made to the results. This is becuase the FASTDB program does not account for N- and 



C-terminal truncations of the subject sequence when calculating global percent identity. 
For subject sequences truncated at the N- and C-termini, relative to the the query 

25 sequence, the percent identity is corrected by calculating the number of residues of the 
query sequence that are N- and C-terminal of the subject sequence, which are not 
matched/aligned with a corresponding subject residue, as a percent of the total bases of 
the query sequence. Whether a residue is matched/aligned is determined by results of 
the FASTDB sequence alignment This percentage is then subtracted from the percent 

30 identity, calculated by the above FASTDB program using the specified parameters, to 
arrive at a final percent identity score. This final percent identity score is what is used 
for the purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting the percent identity score. That is, 

35 only query residue positions outside the farthest N- and C-terminal residues of the 
subject sequence. 
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For example, a 90 amino acid residue subject sequence is aligned with a 1 00 
residue query sequence to determine percent identity. The deletion occurs at the 
N-terminus of the subject sequence and therefore, the FASTDB alignment does not 
show a matching/alignment of the first 10 residues at the N-terminus. The 10 unpaired 
5 residues represent 10% of the sequence (number of residues at the N- and C-termini not 
matched/total number of residues in the query sequence), so 10% is subtracted from the 
percent identity score calculated by the FASTDB program. If the remaining 90 residues 
were perfectly matched the final percent identity would be 90%. In another example, a 
90 residue subject sequence is compared with a 100 residue query sequence. This time 
10 the deletions are internal deletions so there are no residues at the N- or C-termini of the 
subject sequence which are not matched/aligned with the query. In this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only residue 
g positions outside the N- and C-terminal ends of the subject sequence, as displayed in 

□ the FASTDB alignment, which are not matched/aligned with the query sequnce are 

j^j 15 manually corrected for. No other manual corrections are to made for the purposes of 

y the present invention. 

■F The TL-21 and IL-22 variants may contain alterations in the coding regions, 

non-coding regions, or both. Especially preferred are polynucleotide variants 
5 containing alterations which produce silent substitutions, additions, or deletions, but do 

20 not alter the properties or activities of the encoded polypeptide. Nucleotide variants 



ce 



produced by silent substitutions due to the degeneracy of the genetic code are preferred. 
Moreover, variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or 
added in any combination are also preferred. IL-21 and IL-22 polynucleotide variants 
can be produced for a variety of reasons, e.g., to optimize codon expression for a 
25 particular host (change codons in the human mRNA to those preferred by a bacterial 
host such as E. coli). 

Naturally occurring IL-21 and IL-22 variants are called "allelic variants," and 
refer to one of several alternate forms of a gene occupying a given locus on a 
chromosome of an organism (Genes II, Lewin, B., ed., John Wiley & Sons, New 
30 York (1 985)). These allelic variants can vary at either the polynucleotide and/or 

polypeptide level. Alternatively, non-naturally occurring variants may be produced by 
mutagenesis techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DN A 
. technology, variants may be generated to improve or alter the characteristics of the 
35 IL-2 1 and IL-22 polypeptides. For instance, one or more amino acids can be deleted 
from the N-terminus or C-terminus of the secreted protein without substantial loss of 
biological function. Ron and coworkers reported variant KGF proteins having heparin 
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binding activity even after deleting 3, 8, or 27 amino-terrninal amino acid residues (J. 
Biol. Chem. 268:2984-2988(1993)). Similarly, Interferon gamma exhibited up to ten . 
times higher activity after deleting 8-10 amino acid residues from the carboxy terminus 
of this protein (Dobeli, ex al, J. BiotechnoL 7:199-216 (1988)). 

In the present case, since the IL-21 and IL-22 proteins of the invention are 
highly related to the Interleukin-17-like polypeptide family, deletions of N-terminal 
amino acids up to the cysteine at position 19 of SEQ ID NO:2 and up to the cysteine at 
position 29 of SEQ ID NO:4 may retain some biological activity. Polypeptides having 
further N-terminal deletions including the cysteine- 19 residue in SEQ ID NO:2 and the 
cysteine-29 residue in SEQ ID NO:4 would not be expected to retain such biological 
activities because it is likely that these residues are required for forming a disulfide 
bridge to provide structural stability which is needed for receptor binding and signal 
transduction. 

However, even if deletion of one or more amino acids from the N-terminus of a 
protein results in modification or loss of one or more biological functions of the protein, 
other biological activities may still be retained. Thus, the ability of the shortened 
protein to induce and/or bind to antibodies which recognize the complete or mature 
IL-21 or IL-22 proteins generally will be retained when less than the majority of the 
residues of the complete or mature IL-2 1 or TL-22 proteins are removed from the 
N-termini of the respective proteins. Whether a particular polypeptide lacking 
N-terminal residues of a complete protein retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence of the IL-21 
polypeptide shown in SEQ ID NO:2, up to the cysteine residue at position number 19, 
and polynucleotides encoding such polypeptides. In addition, the present invention 
further provides polypeptides having one or more residues deleted from the amino 
terminus of the amino acid sequence of the IL-22 polypeptide shown in SEQ ID NO:4, 
up to the cysteine residue at position number 29, and polynucleotides encoding such 
polypeptides. In particular, the present invention provides polypeptides comprising the 
amino acid sequence of residues n-87 of SEQ ID NO: 2, where n is an integer in the 
range of 1 to 18, and 1 9 is the position of the first residue from the N-terminus of the 
complete IL-21 polypeptide (shown in SEQ ID NO:2) believed to be required for the 
receptor binding activity of the IL-21 protein. Likewise, the present invention provides 
polypeptides comprising the amino acid sequence of residues n-160 of SEQ ID NO:4, 
where n is an integer in the range of 1 to 28, and 29 is the position of the first residue 
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from the N-terminus of the complete EL-22 polypeptide (shown in SEQ ID NO:4) 
believed to be required for the receptor binding activity of the LL-22 protein. 

More in particular, the invention provides polynucleotides encoding 
polypeptides having the amino acid sequence of residues 1-87, 2-87, 3-87, 4-87, 5-87, 

6- 87, 7-87, 8-87, 9-87, 10-87, 11-87, 12-87, 13-87, 14-87, 15-87, 16-87, 17-87, 
18-87, and 19-87 of SEQ ID NO:2. Polynucleotides encoding these polypeptides also 
are provided. The invention also provides polynucleotides encoding polypeptides 
having the amino acid sequence of residues 1-160, 2-160, 3-160, 4-160, 5-160, 6-160, 

7- 160, 8-160, 9-160, 10-160, 11-160, 12-160, 13-160, 14-160, 15-160, 16-160, 
17-160, 18-160, 19-160, 20-160, 21-160, 22-160, 23-160, 24-160, 25-160, 26-160, 
27-160, 28-160, and 29-160 of SEQ ID NO:4. Polynucleotides encoding these 
polypeptides also are provided. 

In addition, since the TL-21 and 1L-22 proteins of the invention are highly 
related to the IL-17-like polypeptide family, deletions of C-terrninal amino acids up to 
the leucine at postion 83 of SEQ ID NO:2 and up to the proline at position 129 of SEQ 
TO NO:4 may retain some biological activity. Polypeptides having further C-terminal 
deletions including the leucine residue at position 83 of SEQ ID NO:2 and the proline at 
position 129 of SEQ ID NO:4 would not be expected to retain such biological activities 
since these residues are in the beginning of the conserved domain required for 
biological activities. 

However, even if deletion of one or more amino acids from the C-terminus of a 
protein results in modification of loss of one or more biological functions of the protein, 
other biological activities may still be retained. Thus, the ability of the shortened 
protein to induce and/or bind to antibodies which recognize the complete or mature 
IL-21 and IL-22 proteins generally will be retained when less than the majority of the 
residues of the complete or mature IL-21 and IL-22 proteins are removed from the 
C-terminus. Whether a particular polypeptide lacking C-terminal residues of a complete 
protein retains such immunologic activities can readily be determined by routine 
methods described herein and otherwise known in the art. 

Accordingly, the present invention further provides polypeptides having one or 
more residues removed from the carboxy terminus of the amino acid sequence of the 
IL-21 polypeptide shown in SEQ ID NO:2, up to the leucine residue at position 83 of 
SEQ ID NO:2, and polynucleotides encoding such polypeptides. In addition, the 
present invention further provides polypeptides having one or more residues removed 
from the carboxy terminus of the amino acid sequence of the IL-22 polypeptide shown 
in SEQ ID NO:4, up to the proline residues at position 129 of SEQ ID NO:4. In 
particular, the present invention provides polypeptides having the amino acid sequence 
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of residues 1-m of the amino acid sequence in SEQ ID NO:2, where m is any integer in 
the range of 83 to 87, and residue 82 is the position of the first residue from the 
C-terminus of the complete IL-21 polypeptide (shown in SEQ ID NO:2) believed to be 
required for activity of the IL-21 protein. In addition, the present invention also 
provides polypeptides having the amino acid sequence of residues 1-m of the amino 
acid sequence in SEQ ID NO:4, where m is any integer in the range of 129 to 160, and 
residue 128 is the position of the first residue from the C-terminus of the complete 
IL-22 polypeptide (shown in SEQ ID NO:4) believed to be required for activity of the 
IL-22 protein. 

More in particular, the invention provides polynucleotides encoding 
polypeptides having the amino acid sequence of residues 1-83, 1-84, 1-85, 1-86, and 
1-87 of SEQ ID NO-2. Polynucleotides encoding these polypeptides also are provided. 
The present invention also provides polynucleotides encoding polypeptides having the 
amino acid sequence of residues 1-129, 1-130, 1-131, 1-132, 1-133, 1-134, 1-135, 
1-136, 1-137, 1-138, 1-139, 1-140, 1-141, 1-142, 1-143, 1-144, 1-145, 1-146, 
1-147, 1-148, 1-149, 1-150, 1-151, 1-152, 1-153, 1-154, 1-155, 1-156, 1-157, 
1-158, 1-159, and 1-160 of SEQ ID NO:4. Polynucleotides encoding these 
polypept ides also are provided. 

The invention also provides polypeptides having one or more amino acids 
deleted from both the amino and the carboxyl termini of IL-21, which may be described 
generally as having residues n-m of SEQ ID NO:2, where n and m are integers as 
described above. Likewise, the invention also provides polypeptides having one or 
more amino acids deleted from both the amino and the carboxyl termini of IL-22, which 
may be described generally as having residues n-m of SEQ ID NO:4, where n and m 
are integers as described above. 

Moreover, ample evidence demonstrates that variants often retain a biological 
activity similar to that of the naturally occurring protein. For example, Gayle and 
coworkers conducted extensive mutational analysis of human cytokine IL-la (/. Biol. 
Chem. 268:22105-221 1 1 (1993)). They used random mutagenesis to generate over 
3,500 individual IL-la mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the molecule. Multiple mutations were examined at every possible 
amino acid position. The investigators found that "[m]ost of the molecule could be 
altered with little effect on either [binding or biological activity]" (see, Abstract). In 
. fact, only 23 unique amino acid sequences, out of more than 3,500 nucleotide 
sequences examined, produced a protein that significantly differed in activity from 
wild-type. 
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Furthermore, even if deleting one or more amino acids from the N-terminus or 
C-terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of a 
deletion variant to induce and/or to bind antibodies which recognize the secreted form 
will likely be retained when less than the majority of the residues of the secreted form 
are removed from the N-terminus or C-terminus. Whether a particular polypeptide 
lacking N- or C-terminal residues of a protein retains such immunogenic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. 

As mentioned above, even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other biological activities may still be retained. Thus, the 
ability of the shortened protein to induce and/or bind to antibodies which recognize the 
complete or mature IL-21 or EL-22 proteins generally will be retained when less than the 
majority of the residues of the complete or mature IL-21 or IL-22 proteins arc removed 
from the N-termini of the respective proteins. Whether a particular polypeptide lacking 
N-terminal residues of a complete protein retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence of the IL-21 
polypeptide shown in SEQ ID NO:2, up to the valine residue at position number 82, 
and polynucleotides encoding such polypeptides. In addition, the present invention 
further provides polypeptides having one or more residues deleted from the amino 
terminus of the amino acid sequence of the EL-22 polypeptide shown in SEQ ID NO:4, 
up to the aspartic acid residue at position number 155, and polynucleotides encoding 
such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues n'-87 of SEQ ID NO: 2, where n" is an 
integer in the range of 1 to 82, and 83 is the position of the first residue from the 
N-terminus of the complete IL-21 polypeptide (shown in SEQ ID NO:2) believed to be 
required for immunogenic activity of the IL-21 protein. Likewise, the present invention 
provides polypeptides comprising the amino acid sequence of residues n"-160 of SEQ 
ID NO:4, where n" is an integer in the range of 1 to 155, and 156 is the position of the 
. first residue from the N-terminus of the complete IL-22 polypeptide (shown in SEQ ID 
NO:4) believed to be required for immunogenic activity of the LL-22 protein. 

More in particular, the invention provides polynucleotides encoding 
polypeptides comprising the amino acid sequence of residues R-2 to V-87; V-3 to V-87; 
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D-4 to V-87; T-5 to V-87; D-6 to V-87; E-7 to V-87; D-8 to V-87; R-9 to V-87; Y-10 to 
V-87; P-1 1 to V-87; Q-12 to V-87; K-13 to V-87; L-14 to V-87; A-t5 to V-87; F-16 to 
V-87; A-17 to V-87; E-18 to V-87; C-19 to V-87; L-20 to V-87; C-21 to V-87; R-22 to 
V-87; G-23 to V-87; C-24 to V-87; 1-25 to V-87; D-26 to V-87; A-27 to V-87; R-28 to 
5 V-87; T-29 to V-87; G-30 to V-87; R-3 1 to V-87; E-32 to V-87; T-33 to V-87; A-34 to 
V-87; A-35 to V-87; L-36 to V-87; N-37 to V-87; S-38 to V-87; V-39 to V-87; R-40 to 
V-87; L-41 to V-87; L-42 to V-87; Q-43 to V-87; S-44 to V-87; L-45 to V-87; L^6 to 
V-87; V-47 to V-87; L^8 to V-87; R-49 to V-87; R-50 to V-87; R-51 to V-87; P-52 to 
V-87; C-53 to V-87; S-54 to V-87; R-55 to V-87; D-56 to V-87; G-57 to V-87; S-58 to 
10 V-87; G^59 to V-87; L-60 to V-87; P-61 to V-87; T-62 to V-87; P-63 to V-87; G-64 to 
V-87; A-65 to V-87; F-66 to V-87; A-67 to V-87; F-68 to V-87; 11-69 to V-87; T-70 to 
V-87; E-7 1 to V-87; F-72 to V-87; 1-73 to V-87; H-74 to V-87; V-75 to V-87; P-76 to 
^ V-87; V-77 to V-87; G-78 to V-87; C-79 to V-87; T-80 to V-87; C-8 1 to V-87; and 

□ V-82 to V-87 of SEQ ID NO:2. Polynucleotides encoding these polypeptides also are 

J® 15 provided. 

y Further, the invention provides polynucleotides encoding polypeptides 

~ comprising the amino acid sequence of residues S-2 to P-1 60; A-3 to P-1 60; R-4 to 
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P-160; A-5 to P-160; R-6 to P-160; A-7 to P-160; V-8 to P-160; L-9 to P-160; S-10 to 
P P-160; A-1 1 to P-160; F-12 to P-160; H-13 to P-160; H-14 to P-160; T-15 to P-160; 

20 L-16 to P-160; Q-17 to P-160; L-18 to P-160; G-19 to P-160; P-20 to P-160; R-21 to 
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ru 

£ P-160; E-22 to P-160; Q-23 to P-160; A-24 to P-160; R-25 to P-160; N-26 to P-160; 

5 A-27 to P-160; S-28 to P-160; C-29 to P-160; P-30 to P-160; A-31 to P-160; G-32 to 

P-160; G-33 to P-160; R-34 to P-160; P-35 to P-160; A-36 to P-160; D-37 to P-160; 
R-38 to P-160; R-39 to P-160; F-40 to P-160; R-41 to P-160; P-42 to P-160; P-43 to 
25 P-160; T-44 to P- 1 60; N-45 to P- 1 60; L-46 to P- 1 60; R-47 to P- 1 60; S-48 to P- 1 60; 
V-49 to P-160; S-50 to P-160; P-51 to P-160; W-52 to P-160; A-53 to P-160; Y-54 to 
P-160; R-55 to P-160; 1-56 to P-160; S-57 to P-160; Y-58 to P-160; D-59 to P-160; 
P-60 to P-160; A-61 to P-160; R-62 to P-160; Y-63 to P-160; P-64 to P-160; R-65 to 
P-160; Y-66 to P-160; L-67 to P-160; P-68 to P-160; E-69 to P-160; A-70 to P-160; 
30 Y-71 to P-160; C-72 to P-160; L-73 to P-160; C-74 to P-160; R-75 to P-160; G-76 to 
P-160; C-77 to P-160; L-78 to P-160; T-79 to P-160; G-80 to P-160; L-81 to P-160; 
F-82 to P-160; G-83 to P-160; E-84 to P-160; E-85 to P-160; D-86 to P-160; V-87 to 
P-160; R-88 to P-160; F-89 to P-160; R-90 to P-160; S-91 to P-160; A-92 to P-160; 
P-93.to P-160; V-94 to P-160; Y-95 to P-160; M-96 to P-160; P-97 to P-160; T-98 to 
35 P-160; V-99 to P-160; V-100 to P-160; L-101 to P-160; R-102 to P-160; R-103 to 
P-160; T-104 to P-160; P-105 to P-160; A-106 to P-160; C-107 to P-160; A-108 to 
P-160; G-109 to P-160; G-l 10 to P-160; R-l 1 1 to P-1 60; S-l 12 to P-160; V-l 1 3 to 
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P-160; Y-l 14 to P-160; T-115 to P-160; E-l 16 to P-160; A-117 to P-160; Y-U8 to 
P-160; V-1 19 to P-160; T- 120 to P-160; 1-121 to P-160; P-122 to P-160; V-123 to 
P-160; G-124 to P-160; C-125 to P-160; T-126 to P-160; C-127 to P-160; V-128 to 
P-160; P-129 to P-160; E-130to P-160; P-131 to P-160; E-132 to P-160; K-133 to 
P-160; D-134 to P-160; A-135 to P-160; D-136 to P-160; S-137 to P-160; 1-138 to 
P-160; N-139 to P-160; S-140 to P-160; S-141 to P-160; T-142 to P-160; D-143 to 
P-160; K-144 to P-160; Q-145 to P-160; G-146 to P-160; A-147 to P-160; K-148 to 
P-160; L-149 to P-160; L-150 to P-160; L-151 to P-160; G-152 to P-160; P-153 to 
P-160; N-154 to P-160; and D-155 to P-160 of SEQ ID NO:4. Polynucleotides 
encoding these polypeptides also are provided. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other biological activities may still be retained. Thus, the 
ability of the shortened protein to induce and/or bind to antibodies which recognize the 
complete or mature IL-2 1 and IL-22 proteins generally will be retained when less than 
the majority of the residues of the complete or mature 1L-21 and TL-22 proteins are 
removed from the C-terrninus. Whether a particular polypeptide lacking C-tcrminal 
residues of a complete protein retains such immunologic activities can readily be 
determined by routine methods described herein and otherwise known in the art. 

Accordingly, the present invention further provides polypeptides having one or 
more residues removed from the carboxy terminus of the amino acid sequence of the 
TL-21 polypeptide shown in SEQ ID NO:2, up to the aspartic acid residue at position 6 
of SEQ ID NO:2, and polynucleotides encoding such polypeptides. In addition, the 
present invention further provides polypeptides having one or more residues removed 
from the carboxy terminus of the amino acid sequence of the TL-22 polypeptide shown 
in SEQ ID NO:4, up to the arginine residues at position 6 of SEQ TD NO:4. In 
particular, the present invention provides polypeptides having the amino acid sequence 
of residues 1-m' of the amino acid sequence in SEQ ID NO:2, where m' is any integer 
in the range of 6 to 87, and residue 5 is the position of the first residue from the 
C-terminus of the complete TL-21 polypeptide (shown in SEQ ID NO:2) believed to be 
required for immunogenic activity of the IL-21 protein. In addition, the present 
invention also provides polypeptides having the amino acid sequence of residues 1-m" 
of the amino acid sequence in SEQ ID NO:4, where m" is any integer in the range of 6 
to 160, and residue 5 is the position of the first residue from the C-terminus of the 
complete IL-22 polypeptide (shown in SEQ ID NO:4) believed to be required for 
immunogenic activity of the IL-22 protein. 
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More in particular, the invention provides polynucleotides encoding 
polypeptides having the amino acid sequence of residues A-l to S-86; A-l to R-85; A-l 
to P-84; A-l to L-83; A-l to V-82; A-l to C-81; A-l to T-80; A-l to C-79; A-l to G-78; 
A-l to V-77; A-l to P-76; A-l to V-75; A-l to H-74; A-l to 1-73; A-l to F-72; A-l to 
E-71; A-l to T-70; A-l to H-69; A-l to F-68; A-l to A-67; A-l to F-66; A-l to A-65; 
A-l to G-64; A-l to P-63; A-l to T-62; A-l to P-61; A-l to L-60; A-l to G-59; A-l to 
S-58; A-l to G-57; A-l to D-56; A-l to R-55; A-l to S-54; A-l to C-53; A-l to P-52; 
A-l toR-51; A-l to R-50; A-l to R-49; A-l toL-48; A-l to V-47; A-l toL-46; A-l to 
L^5; A-l to S-44; A-l to Q-43; A-l to L-42; A-l to L-41 ; A-l to R-40; A-l to V-39; 
A-l to S-38; A-l to N-37; A-l to L-36; A-l to A-35; A-l to A-34; A-l to T-33; A- 1 to 
E-32; A-l to R-31; A-l to G-30; A-l to T-29; A-l to R-28; A-l to A-27; A-l to D-26; 
A-l to 1-25; A-l to C-24; A-l to G-23; A-l to R-22; A-l to C-21 ; A-l to L-20; A-l to 
C-19; A-l toE-18; A-l to A-17; A-l toF-16; A-l to A-15; A-l toL-14; A-l to K-13; 
A-l to Q-12; A-l to P-l I; A-l to Y-10; A-l to R-9; A-l to D-8; A-l to E-7; and A-l to 
D-6 of SEQ ID NO:2. Polynucleotides encoding these polypeptides also are provided. 

Moreover, the invention also provides polynucleotides encoding polypeptides 
having the amino acid sequence of residues N-l to G-159; N-l to A- 1 58; N-l to P-l 57; 
N-l to A-156;N-1 to D-155;N-I toN-154; N-l toP-153;N-l to G-152; N-l to 
L-151; N-l to L-150; N-l to L-149; N-l to K-148; N-l to A-147; N-l to G-146; N-l 
to Q-145; N-l to K-144; N-l to D-143; N-l to 1-142; N-l to S-141 ; N-l to S-140; N-l 
toN-139; N-l to 1-138; N-l to S-137; N-l to D-136; N-l to A-135; N-l to D-134; N-l 
to K-133;N-1 toE-132;N-l to P-131;N-1 toE-130; N-l to P-129; N-l toV-128; N-l 
to C-127; N-l to T-126; N-l to C-125; N-l to G-124; N-l to V-123; N-l to P-122; 
N-l toI-121;N-l to T-120; N-l to V-119; N-l toY-H8;N-l to A-117;N-l toE-116; 
N-l toT-115;N-l to Y-114; N-l to V-113;N-1 toS-112;N-l to R-lll; N-l to 
G-l 10; N-l to G-109; N-l to A-108; N-l to C-107; N-l to A-106; N-l to P-105; N-l 
toT-104; N-l loR-103;N-l toR-102; N-l to L-101; N-l toV-100; N-l toV-99;N-l 
to T-98; N-l to P-97; N-l to M-96; N-l to Y-95; N-l to V-94; N-l to P-93; N-l to 
A-92; N-l to S-91; N-l to R-90; N-l to F-89; N-l to R-88; N-l to V-87; N-l to D-86; 
N-l to E-85; N-l to E-84; N-l to G-83; N-l to F-82; N-l to L-81; N-l to G-80; N-l to 
T-79; N-l to L-78; N-l to C-77; N-l to G-76; N-l to R-75; N-l to C-74; N-l to L-73; 
N-l toC-72;N-ltoY-7l;N-l toA-70;N-l toE-69; N-l toP-68;N-l toL-67; N-l to 
Y-66; N-l to R-65; N-l to P-64; N-l to Y-63; N-l to R-62; N-l to A-61; N-l to P-60; 
N-l to D-59; N-l to Y-58; N-l to S-57; N-l to 1-56; N-l to R-55; N-l to Y-54; N-l to 
A-53; N-l to W-52; N-l to P-51; N-l to S-50; N-l to V-49; N-l to S-48; N-l to R-47; 
N-l to L-46; N-l to N-45; N-l to T-44; N-l to P-43; N-l to P-42; N-l to R-41; N-l to 
F-40; N-l to R-39; N-l to R-38; N-l to D-37; N-l to A-36; N-l to P-35; N-l to R-34; 
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N-l toG-33;N-l toG-32;N-i to A-31;N-1 to P-30; N-l to C-29; N-1 to S-28; N-1 to 
A-27; N-l to N-26; N-l to R-25; N-l to A-24; N-l to Q-23; N-l to E-22; N-l to R-21; 
N-l to P-20; N-l to G-19; N-l to L-18; N-l to Q-17; N-l to L-16; N-l to T-15; N-l to 
H- 14; N-l to H-13;N-1 toF-12;N-l to A-11;N-1 to S-10; N-l toL-9; N-l to V-8; 
N-l to A-7; and N-l to R-6 of SEQ ID NO:4. Polynucleotides encoding these 
polypeptides also are provided. 

The invention also provides polypeptides having one or more amino acids 
deleted from both the amino and the carboxyl termini of 1L-21, which may be described 
generally as having residues n'-m' of SEQ ID NO: 2, where n' and m' are integers as 
described above. Likewise, the invention also provides polypeptides having one or 
more amino acids deleted from both the amino and the carboxyl termini of IL-22, which 
may be described generally as having residues n"-rn" of SEQ ID NO:4, where n" and 
m" are integers as described above. 

Moreover, any polypeptide having one or more amino acids deleted from both 
the amino and the carboxyl termini of iL-22, described specifically as having residues 
n"-m" of SEQ ID NO:4 (where n" and m" are integers as described above) may be 
excluded from the invention. In particular, any polypeptide having one or more amino 
acids deleted from both the amino and the carboxyl termini of IL-22 and which is 
defined by residues n"-m M of SEQ ID NO:4, where n" is equal to 21, 22, 23, 24 or 25 
and m" is equal to 271, 272, 273, 274, 275 or 276 may be excluded from the 
invention. 

The invention further includes IL-21 and IL-22 polypeptide variants which 
show substantial biological activity. Such variants include deletions, insertions, 
inversions, repeats, and substitutions selected according to general rules known in the 
art so as have little effect on activity. For example, guidance concerning how to make 
phenotypically silent amino acid substitutions is provided by Bowie and colleagues 
(Science 247:1306-1310 (1990)), wherein the authors indicate that there are two main 
strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, the amino acid positions 
where substitutions have been tolerated by natural selection indicates that these 
positions are not critical for protein function. Thus, positions tolerating amino acid 
substitution could be modified while still maintaining biological activity of the protein. 

The second strategy uses genetic engineering to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein function. For 
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example, site directed mutagenesis or alaninc-scanning mutagenesis (introduction of 
single alanine mutations at every residue in the molecule) can he used (Cunningham and 
Wells, Science 244:1081-1085 (1989)). The resulting mutant molecules can then be 
tested for biological activity. 

5 As the authors state, these two strategies have revealed that proteins are 

surprisingly tolerant of amino acid substitutions. The authors further indicate which 
amino acid changes are likely to be permissive at certain amino acid positions in the 
protein. For example, most buried (within the tertiary structure of the protein) amino 
acid residues require nonpolar side chains, whereas few features of surface side chains 

10 are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of an aliphatic or hydrophobic amino acid with another aliphatic or 
hydrophobic amino acid such as Ala, Val, Leu or lie; replacement of a hydroxyl residue 
with another hydroxyl residue such as Ser or Thr; replacement of an acidic residue with 



3 another acidic residue such as Asp or Glu; replacement of an amide residue with another 

^ 1 5 amide residue such as Asn or Gin, replacement of a basic residue with another basic 
j residue such as Lys, Arg, or His; replacement of an aromatic residue with another 

P aromatic residue such as Phe, Tyr, or Trp, and replacement of a small-sized amino acid 

t with another small -sized residue such as Ala, Ser, Thr, Met, or Gly. 

3 Besides conservative amino acid substitution, variants of IL-21 and IL-22 

J! 20 include (i) substitutions with one or more of the non-conserved amino acid residues, 
g where the substituted amino acid residues may or may not be one encoded by the 

0 genetic code, or (ii) substitution with one or more of amino acid residues having a 



substituent group, or (iii) fusion of the mature polypeptide with another compound, 
such as a compound to increase the stability and/or solubility of the polypeptide (for 

25 example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino 
acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

For example, TL-2 1 and 11.-22 polypeptide variants containing amino acid 

30 substitutions of charged amino acids with other charged or neutral amino acids may 
produce proteins with improved characteristics, such as less aggregation. Aggregation 
of pharmaceutical formulations both reduces activity and increases clearance due to the 
aggregate's immunogenic activity (Pinckard, et til, Clin. Exp. Immunol. 2:331-340 
(1967); Robbins, et «/., Diabetes 36:838-845 (1987); Cleland, etal, Crit Rev. Ther. 

35 Drug Carrier Systems 10:307-377 (1993)). 
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Polynucleotide and Polypeptide Fragments 

The invention provides nucleic acid molecules having nucleotide sequences 
related to extensive portions of SEQ ID NO:3 which have been determined from the 
following related cDNA clones: HE2CD08R (SEQ ID NO:24); HAGBX04R (SEQ ID 
5 NO:25); HCEBA24FB (SEQ ID NO:26); and HCELE59R (SEQ ID NO:27). 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clones or 
shown in SEQ ID NO: 1 or SEQ ID NO:3. The short nucleotide fragments are 
preferably at least about 15 nt, and more preferably at least about 20 nt, still more 

10 preferably at least about 30 nt, and even more preferably, at least about 40 nt in length. 
A fragment "at least 20 nt in length," for example, is intended to include 20 or more 
contiguous bases from the cDNA sequence contained in the deposited clones or the 
nucleotide sequences shown in SEQ ID NO:l and SEQ ID NO:3. These nucleotide 
fragments are useful as diagnostic probes and primers as discussed herein. Of course, 

15 larger fragments (e.g., 50, 1 50, 500, 600, 2000 nucleotides) are preferred. 

Moreover, representative examples of 1L-21 polynucleotide fragments include, 
for example, fragments having a sequence from about nucleotide number 1-50, 51-100, 
101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401-450, 451-500, 
501-550, 551-600, 651-700. or 701 to the end of SEQ ID NO:l or the cDNA contained 

20 in the deposited clone. In addition, representative examples of IL-22 polynucleotide 
fragments include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 
401-450, 451-500, 501-550, 551-600, 601-650, 651-700, 701-750, 751-800, 
801-850, 851-900, 901-950, 951-1000, 1001-1050, 1051-1100, 1101-1150, 

25 1151-1200, .1201-1250, 1251-1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 
1551-1600, or 1601 to the end of SEQ ID NO:3 or the cDNA contained in the deposited 
clone. In this context "about" includes the particularly recited ranges, larger or smaller 
by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. 
Preferably, these fragments encode a polypeptide which has biological activity. 

30 Further, the invention includes a polynucleotide comprising any portion of at 

least about 30 nucleotides, preferably at least about 50 nucleotides, of SEQ ID NO: 1 
from residue 1 to 650, 25 to 650, 50 to 650, 75 to 650, 100 to 650, 125 to 650, 150 to 
650, 175 to 650, 200 to 650, 225 to 650, 250 to 650, 275 to 650, 300 to 650, 325 to 
650, 350 to 650, 375 to 650, 400 to 650, 425 to 650, 500 to 650, 525 to 650, 550 to 

35 650, 575 to 650, 600 to 650, 625 to 650, 1 to 600, 25 to 600, 50 to 600, 75 to 600, 
100 to 600, 125 to 600, 150 to 600, 175 to 600, 200 to 600, 225 to 600, 250 to 600, 
275 to 600. 300 to 600, 325 to 600, 350 to 600, 375 to 600. 400 to 600, 425 to 600, 
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500 to 600, 525 to 600, 550 to 600, 575 to 600, 1 to 550, 25 to 550, 50 to 550, 75 to 
550, 100 to 550, 125 to 550, 150 to 550, 175 to 550, 200 to 550, 225 to 550, 250 to 
550, 275 to 550, 300 to 550, 325 to 550, 350 to 550, 375 to 550, 400 to 550, 425 to 
550, 500 to 550, 525 to 550, 1 to 500, 25 to 500, 50 to 500, 75 to 500, 100 to 500, 
125 to 500, 150 to 500, 175 to 500, 200 to 500, 225 to 500, 250 to 500, 275 to 500, 
300 to 500, 325 to 500, 350 to 500, 375 to 500, 400 to 500, 425 to 500, 450 to 500, 
475 to 500, 1 to 450, 25 to 450, 50 to 450, 75 to 450, 100 to 450, 125 to 450, 150 to 
450, 175 to 450, 200 to 450, 225 to 450, 250 to 450. 275 to 450, 300 to 450, 325 to 
450, 350 to 450, 375 to 450, 400 to 450, 425 to 450, 1 to 400, 25 to 400, 50 to 400, 
75 to 400, 100 to 400, 125 to 400, 150 to 400, 175 to 400, 200 to 400, 225 to 400, 
250 to 400, 275 to 400, 300 to 400, 325 to 400, 350 to 400, 375 to 400, 1 to 350, 25 
to 350, 50 to 350, 75 to 350, 100 to 350, 125 to 350, 150 to 350, 175 to 350, 200 to 
350, 225 to 350, 250 to 350, 275 to 350, 300 to 350, 325 to 350, 1 to 300, 25 to 300, 
50 to 300, 75 to 300, 100 to 300, 125 to 300, 150 to 300, 175 to 300, 200 to 300, 225 
to 300, 250 to 300, 275 to 300, 1 to 250, 25 to 250, 50 to 250, 75 to 250, 100 to 250, 
125 to 250, 150 to 250, 175 to 250, 200 to 250, 225 to 250, 1 to 200, 25 to 200, 50 to 
200, 75 to 200, 100 to 200, 125 to 200, 150 to 200, 175 to 200, 1 to 150, 25 to 150, 
50 to 150, 75 to 150, 100 to 150, 125 to 150, 1 to 100, 25 to 100, 50 to 100, 75 to 
100, 1 to 50, and 25 to' 50. 

Moreover, the invention includes a polynucleotide comprising any portion of at 
least about 30 nucleotides, preferably at least about 50 nucleotides, of SEQ ID NO:3 
from residue 300 to 850. More preferably, the invention includes a polynucleotide 
comprising nucleotide residues 50 to 850, 75 to .850, 100 to 850, 125 to 850, 150 to 
850, 175 to 850, 200 to 850, 225 to 850, 250 to 850, 275 to 850, 300 to 850, 325 to 
850, 350 to 850, 375 to 850, 400 to 850, 425 to 850, 450 to 850, 475 to 850, 500 to 
850, 525 to 850, 550 to 850, 575 to 850, 600 to 850, 625 to 850, 650 to 850, 675 to 
850, 700 to 850, 750 to 850, 775 to 850, 800 to 850, 50 to 800, 75 to 800, 100 to 
800, 125 to 800, 150 to 800, 175 to 800, 200 to 800, 225 to 800, 250 to 800, 275 to 
800, 300 to 800, 325 to 800, 350 to 800, 375 to 800, 400 to 800, 425 to 800, 450 to 
800, 475 to 800, 500 to 800, 525 to 800, 550 to 800, 575 to 800, 600 to 800, 625 to 
800, 650 to 800, 675 to 800, 700 to 800, 750 to 800, 50 to 750, 75 to 750, 100 to 
750, 125 to 750, 150 to 750, 175 to 750, 200 to 750, 225 to 750, 250 to 750, 275 to 
750, 300 to 750, 325 to 750, 350 to 750, 375 to 750, 400 to 750, 425 to 750, 450 to 
750, 475 to 750, 500 to 750, 525 to 750, 550 to 750, 575 to 750, 600 to 750, 625 to 
750, 650 to 750, 675 to 750, 700 to 750, 50 to 700, 75 to 700, 100 to 700, 125 to 
700, 150 to 700, 175 to 700, 200 to 700, 225 to 700, 250 to 700, 275 to 700, 300 to 
700, 325 to 700, 350 to 700, 375 to 700, 400 to 700, 425 to 700, 450 to 700, 475 to 
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700, 500 to 700, 525 to 700, 550 to 700, 575 to 700, 600 to 700, 625 to 700, 650 to 
700, 50 to 650, 75 to 650, 100 to 650, 125 to 650, 150 to 650, 175 to 650, 200 to 
650, 225 to 650, 250 to 650, 275 to 650, 300 to 650, 325 to 650, 350 to 650, 375 to 
650, 400 to 650, 425 to 650, 450 to 650, 475 to 650, 500 to 650, 525 to 650, 550 to 
650, 575 to 650, 600 to 650, 50 to 600, 75 to 600, 100 to 600, 125 to 600, 150 to 
600, 175 to 600, 200 to 600, 225 to 600, 250 to 600, 275 to 600, 300 to 600, 325 to 
600, 350 to 600, 375 to 600, 400 to 600, 425 to 600, 450 to 600, 475 to 600, 500 to 
600, 525 to 600, 550 to 600, 50 to 550, 75 to 550, 100 to 550, 125 to 550, 150 to 
550, 175 to 550, 200 to 550, 225 to 550, 250 to 550, 275 to 550, 300 to 550, 325 to 
550, 350 to 550, 375 to 550, 400 to 550, 425 to 550, 450 to 550, 475 to 550, 500 to 
550, 50 to 500, 75 to 500, 100 to 500, 125 to 500, 150 to 500, 175 to 500, 200 to 
500, 225 to 500, 250 to 500, 275 to 500, 300 to 500, 325 to 500, 350 to 500, 375 to 
500, 400 to 500, 425 to 500, 450 to 500, 50 to 450, 75 to 450, 100 to 450, 125 to 
450, 150 to 450, 175 to 450, 200 to 450, 225 to 450, 250 to 450, 275 to 450, 300 to 
450, 325 to 450, 350 to 450, 375 to 450, 400 to 450, 50 to 400, 75 to 400, 100 to 
400, 125 to 400, 150 to 400, 175 to 400, 200 to 400, 225 to 400, 250 to 400, 275 to 
400, 300 to 400, 325 to 400, 350 to 400, 50 to 350, 75 to 350, 100 to 350, 125 to 
350, 150 to 350, 175 to 350, 200 to 350, 225 to 350, 250 to 350, 275 to 350, 300 to 
350, 50 to 300, 75 to 300, 100 to 300, 125 to 300, 150 to 300, 175 to 300, 200 to 
300, 225 to 300, and 250 to 300. 

In the present invention, a "polypeptide fragment" refers to a short amino acid 
sequence contained in SEQ ID NO:2 or SEQ ID NO:4 or encoded by the cDNAs 
contained in the deposited clones. Protein fragments may be "free-standing," or 
comprised within a larger polypeptide of which the fragment forms a part or region, 
most preferably as a single continuous region. Representative examples of polypeptide 
fragments of the TL-21 invention, include, for example, fragments from about amino 
acid number 1-20, 21-40, 41-60, 61-83 or to the end of the coding region. Moreover, 
polypeptide fragments of IL-21 can be about 10, 20, 30, 40, 50, 60, 70, or 80 amino 
acids in length. Representative examples of polypeptide fragments of the IL-22 
invention, include, for example, fragments from about amino acid number 1-20, 21-40, 
41-60, 61-80, 81-100, 100-120, 120-140, 140-160, or to the end of the coding region. 
Moreover, polypeptide fragments of IL-22 can be about 10, 20, 30, 40, 50, 60, 70, 80, 
100, 120, 140, or 150 amino acids in length. Tn this context "about" includes the 
particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) amino acids, at 
either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted IL-21 and IL-22 proteins 
as well as the mature forms. Further preferred polypeptide fragments include the 
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secreted IL-21 and TL-22 proteins or the mature forms having a continuous series of 
deleted residues from the amino or the carboxy terminus, or both. For example, any 
number of amino acids, ranging from 1 -60, can be deleted from the amino terminus of 
either the secreted or the mature form of the IL-21 and IL-22 polypeptides. Similarly, 
any number of amino acids, ranging from 1-30, can be deleted from the carboxy 
terminus of the secreted or the mature form of the IL-2 1 and IL-22 polypeptides. 
Furthermore, any combination of the above amino and carboxy terminus deletions are 
preferred. Similarly, polynucleotide fragments encoding these TL-21 and IL-22 
polypeptide fragments are also preferred. 

Also preferred are IL-21 and EL-22 polypeptide and polynucleotide fragments 
characterized by structural or functional domains, such as fragments that comprise 
alpha-helix and alpha-helix forming regions, beta-sheet and bcta-sheet-forming regions, 
turn and turn-forming regions, coil and coil-forming regions, hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions, substrate binding region, and high antigenic index 
regions. Polypeptide fragments of SEQ ID NO:2 and SEQ ID NO:4 falling within 
conserved domains are specifically contemplated by the present invention (Figures 4 
and 5, respectively). Moreover, polynucleotide fragments encoding these domains arc 
also contemplated. 

Other preferred fragments are biologically active IL-21 and IL-22 fragments. 
Biologically active fragments are those exhibiting activity similar, but not necessarily 
identical, to an activity of the IL-21 and IL-22 polypeptides. The biological activity of 
the fragments may include an improved desired activity, or a decreased undesirable 
activity. 

Epitopes & Antibodies 

In the present invention, "epitopes" refer to IL-21 and IL-22 polypeptide 
fragments having antigenic or immunogenic activity in an animal, especially in a 
human. A preferred embodiment of the present invention relates to an IL-21 or TL-22 
polypeptide fragment comprising an epitope, as well as the polynucleotide encoding this 
fragment. A region of a protein molecule to which an antibody can bind is defined as 
an "antigenic epitope". In contrast, an "immunogenic epitope" is defined as a part of a 
protein that elicits an antibody response (see, for instance, Gey sen, et ai t Proc. Natl. 
Acad. ScL USA 81:3998- 4002 (1983)). 

Fragments which function as epitopes may be produced by any conventional 
means (see, e.g., Houghten, R. A., Proc. Natl. Acad. ScL USA 82:5131-5135 
(1985); further described in U.S. Patent No. 4,631,211). 
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Tn the present invention, antigenic epitopes preferably contain a sequence of at 
least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope (see, for instance, Wilson, et 
al„ Cell 37:767-778 (1984); Sutcliffe, J. G. et «/., Science 219:660-666 (1983)). 

Similarly, immunogenic epitopes can be used to induce antibodies according to 
methods well known in the art (see, for instance, Sutcliffe, et al t supra; Wilson, etal, 
supra; Chow, M, et al., Proc. Natl. Acad Set. USA 82:910-914; and Bittle, F. J., et 
al, J. Gen. Virol. 66:2347-2354 (1985)). A preferred immunogenic epitope includes 
the secreted protein. The immunogenic epitopes may be presented together with a 
carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if 
it is long enough (at least about 25 amino acids), without a carrier. However, 
immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be 
sufficient to raise antibodies capable of binding to, at the very least, linear epitopes in a 
denatured polypeptide (e.g., in Western blotting). 

Using DNAstar analysis, SEQ ID NO:2 was found antigenic at amino acids: 
from about Arg-2 to about Pro-1 1, from about Cys-24 to about Glu-32, and from about 
Arg-51 to about Gly-59. Thus, these regions can be used as epitopes to produce 
antibodies against the protein encoded by HTGED19. Again using DNAstar analysis, 
SEQ ID NO;4 was found antigenic at amino acids: from about Gly-19 to about Ala-27, 
from about Pro-30 to about Arg-38, from about Phe-40 to about Ser-48, from about 
Tyr-58 to about Leu-67, from about Pro-105 to about Val-1 1 3, from about Pro-129 to 
about Ser-137, from about Asn-139 to about Ala- 147, and from about Leu-151 to about 
Gly-159. Thus, these regions can be used as epitopes to produce antibodies against the 
protein encoded by HFPBX96. 

As used herein, the term "antibody" (Ab) or "monoclonal ami body" (Mab) is 
meant to include intact molecules as well as antibody fragments (such as, for example, 
Fab and F(ab')2 fragments) which are capable of specifically binding to protien. Fab 
and F(ab')2 fragments lack the Fc fragment of intact antibody, clear more rapidly from 
the circulation, and may have less non-specific tissue binding than an intact antibody 
(Wahl, etal, J. Nucl Med. 24:316-325 (1983)). Thus, these fragments are preferred, 
as well as the products of a FAB or other immunoglobulin expression library. 
Moreover, antibodies of the present invention include chimeric, single chain, and 
humanized antibodies. 
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Fusion Proteins 

Any IL-21 or IL-22 polypeptide can be used to generate fusion proteins. For 
example, the IL-21 or IL-22 polypeptides, when fused to a second protein, can be used 
as an antigenic tag. Antibodies raised against the TL-21 or IL-22 polypeptides can be 
5 used to indirectly detect a second protein by binding to IL-21 or IL-22, respectively. 
Moreover, because secreted proteins target cellular locations based on trafficking 
signals, the IL-21 and IL-22 polypeptides can be used as targeting molecules once 
fused to other proteins. 

Examples of domains that can be fused to the EL-21 and IL-22 polypeptides 
10 include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through 
linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics of 
□ the IL-21 and IL-22 polypeptides. For instance, a region of additional amino acids, 

5J 15 particularly charged amino acids, may be added to the N-terminus of the IL-2 1 and 

y IL-22 polypeptides to improve stability and persistence during purification from the 

£ host cell or during subsequent handling and storage. Also, peptide moieties may be 

s added to the IL-21 and IL-22 polypeptides to facilitate purification. Such regions may 

O be removed prior to final preparation of the IL-2 1 and IL-22 polypeptides. The addition 

|y 20 of peptide moieties to facilitate handling of polypeptides are familiar and routine 

techniques in the art. 

^ Moreover, TL-21 and IL-22 polypeptides, including fragments, and specifically 

epitopes, can be comt : ned with parts of the constant domain of immunoglobulins 
(IgG), resulting in chimeric polypeptides. These fusion proteins facilitate purification 

25 and show an increased half-life in vivo. One reported example describes chimeric 

proteins consisting of the first two domains of the human CD4-polypeptide and various 
domains of the constant regions of the heavy or light chains of mammalian 
immunoglobulins (EP A 394,827; Traunecker, et al, Nature 331:84-86 (1988)). 
Fusion proteins having disulfide-linked dimeric structures (due to the IgG) can also be 

30 more efficient in binding and neutralizing other molecules, than the monomeric secreted 
protein or protein fragment alone (Fountoulakis, et al t 7. Biochem. 270:3958-3964 
(1995)). 

Similarly, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 
35 together with another human protein or part thereof. In many cases, the Fc part in a 
fusion protein is beneficial in therapy and diagnosis, and thus can result in, for 
example, improved pharmacokinetic properties (EP-A 0232 262). Alternatively, 
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deleting the Fc pan after the fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder therapy and diagnosis if the 
fusion protein is used as an antigen for immunizations. In drug discovery, for 
example, human proteins, such as hlL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5 (see, 
Bennett, D., et ai, J. Mol Recog. 8:52-58 (1995); Johanson, K., et ai, J. Biol. 
Chem. 270:9459-9471 (1995)). 

Moreover, the IL-21 and IL-22 polypeptides can be fused to marker sequences, 
such as a peptide which facilitates purification of IL-21 and IL-22, respectively. In 
preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, 
such as the tag provided in a pQE vector (Q1AGEN, Inc., 9259 Eton Avenue, 
Chatsworth, CA, 91311), among others, many of which are commercially available. 
As described by Gentz and coworkers (Proc. Natl. Acad. Sci. USA 86:821-824 
(1989)), for instance, hexa-histidine provides for convenient purification of the fusion 
protein. Another peptide tag useful for purification, the "HA" tag, corresponds to an 
epitope derived from the influenza hemagglutinin protein (Wilson, et aL, Cell 37:767 
(1984)). 

Thus, any of the above fusion proteins can be engineered using the IL-21 and 
IL-22 polynucleotides or the polypeptides. 

Vectors. H ost Cells, and Protein Production 

The present invention also relates to vectors containing the IL-21 and IL-22 
polynucleotides, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
vector. Retroviral vectors may be replication competent or replication defective. In the 
latter case, viral propagation generally will occur only in complementing host cells. 

IL-21 and IL-22 polynucleotides may be joined to a vector containing a 
selectable marker for propagation in a host. Generally, a plasmid vector is introduced 
in a precipitate, such as a calcium phosphate precipitate, or in a complex with a charged 
lipid. If the vector is a virus, it may be packaged in vitro using an appropriate 
packaging cell line and then transduced into host cells. 

The IL-21 and IL-22 polynucleotide inserts should be operatively linked to an 
appropriate promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA 
and tac promoters, the SV40 early and late promoters and promoters of retroviral LTRs, 
to name a few. Other suitable promoters will be known to the skilled artisan. The 
expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
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portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (UAA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillin resistance 
genes for culturing in E. colt and other bacteria. Representative examples of 
appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 
Streptomyces, and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, 
COS, 293, and Bowes melanoma cells; and plant cells. Appropriate culture mediums 
and conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pHE4-5 and other pHE-like 
vectors; pQE70, pQE60 and pQE-9, available from QIAGEN, Inc.; pBluescript 
vectors, Phagescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from 
Stratagene Cloning Systems, Inc.; and ptrc99a ? pKK223-3, pKK233-3, pDR540, 
pRlT5 available from Pharmacia Biotech, Inc. Among preferred eukaryotic vectors are 
pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and pSVK3, 
pBPV, pMSG and pS VL available from Pharmacia. Other suitable vectors will be 
readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection, or other methods. Such methods 
are described in many standard laboratory manuals (for example, Davis, et al, Basic 
Methods In Molecular Biology (1986)). It Is specifically contemplated that IL-21 and 
IL-22 polypeptides may, in fact, be expressed by a host cell lacking a recombinant 
vector. 

IL-21 and BL-22 polypeptides can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") is employed for 
. purification. 

IL-21 and IL-22 polypeptides, and preferably the secreted forms thereof, can 
also be recovered from: products purified from natural sources, including bodily 
fluids, tissues and cells, whether directly isolated or cultured; products of chemical 
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synthetic procedures; and products produced by recombinant techniques from a 
prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, 
insect, and mammalian cells. Depending upon the host employed in a recombinant 
production procedure, the IL-21 and 1L-22 polypeptides may be glycosylated or may be 
5 non-glycosylated. In addition, IL-21 and 1L-22 polypeptides may also include an initial 
modified methionine residue, in some cases as a result of host-mediated processes. 
Thus, it is well known in the art that the N-terminal methionine encoded by the 
translation initiation codon generally is removed with high efficiency from any protein 
after translation in all eukaryotic cells. While the N-terminal methionine on most 
1 0 proteins also is efficiently removed in most prokaryotes, for some proteins, this 

prokaryotic removal process is inefficient, depending on the nature of the amino acid to 
which the N-terminal methionine is covalently linked. 

d 

□ Uses of the lh-Zl flnd IL-2? Polynucleotides 

j^j 1 5 The IL-21 and IL-22 polynucleotides identified herein can be used in numerous 

y ways as reagents. The following description should be considered exemplary and 

£ utilizes known techniques. 

, There exists an ongoing need to identify new chromosome markers, since few 

Q chromosome marking reagents, based on actual sequence data (repeat polymorphisms), 

jljj 20 are presently available. Clone HTGED 19 and clone HFPBX96 can each be mapped to 
\Q a specific chromosome. Thus, IL-2 1 and IL-22 polynucleotides can then be used in 

^ linkage analysis as a marker for those specific chromosome. 

Briefly, sequences can be mapped to chromosomes by preparing PGR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:l and SEQ ID NO:3. 
25 Primers can be selected using computer analysis so that primers do not span more than 
one predicted exon in the genomic DNA. These primers are then used for PCR 
screening of somatic cell hybrids containing individual human chromosomes. Only 
those hybrids containing the human IL-21 or IL-22 genes corresponding to SEQ ID 
NO: 1 or SEQ ID NO:3, respectively, will yield an amplified fragment. 
30 Similarly, somatic hybrids provide a rapid method of PCR mapping the 

polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the IL-21 and IL-22 
polynucleotides can be achieved with panels of specific chromosome fragments. Other 
gene mapping strategies that can be used include in situ hybridization, prescreening 
35 with labeled flow-sorted chromosomes, and preselection by hybridization to construct 
chromosome specific-cDNA libraries. 
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Precise chromosomal location of the IL-21 and TL-22 polynucleotides can also 
be achieved using fluorescence in situ hybridization (FISH) of a metaphase 
chromosomal spread. This technique uses polynucleotides as short as 500 or 600 
bases; however, polynucleotides 2,000-4,000 bp are preferred (For review, see Verma, 
et aL> "Human Chromosomes: a Manual of Basic Techniques," Pergamon Press, New 
York (1988)). 

For chromosome mapping, the IL-21 and IL-22 polynucleotides can be used 
individually (to mark a single chromosome or a single site on that chromosome) or in 
panels (for marking multiple sites and/or multiple chromosomes). Preferred 
polynucleotides correspond to the noncoding regions of the cDNAs because the coding 
sequences are more likely conserved within gene families, thus increasing the chance of 
cross hybridization during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 
physical position of the polynucleotide can be used in linkage analysis. Linkage 
analysis establishes coinheritance between a chromosomal location and presentation of a 
particular disease (disease mapping data are found, for example, in McKusick, V., 
Mendelian Inheritance in Man (available on line through Johns Hopkins University 
Welch Medical Library)). Assuming 1 megabase mapping resolution and one gene per 
20 kb, a cDNA precisely localized to a chromosomal region associated with the disease 
could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the IL-21 and IL-22 
polynucleotides and the corresponding genes between affected and unaffected 
individuals can be examined. First, visible structural alterations in the chromosomes, 
such as deletions or translocations, are examined in chromosome spreads or by PCR. 
If no structural alterations exist, the presence of point mutations are ascertained. 
Mutations observed in some or all affected individuals, but not in normal individuals, 
indicates that the mutation may cause the disease. However, complete sequencing of 
the IL-21 and IL-22 polypeptides and the corresponding genes from several normal 
individuals is required to distinguish the mutauon from a polymorphism. If a new 
polymorphism is identified, this polymorphic polypeptide can be used for further 
linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to unaffected individuals can be assessed using IL-21 and 
IL-22 polynucleotides. Any of these alterations (altered expression, chromosomal 
rearrangement, or mutauon) can be used as a diagnostic or prognostic marker. 

In addition to the foregoing, an IL-21 or EL-22 polynucleotide can be used to 
control gene expression through triple helix formation or antisense DNA or RNA. Both 
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methods rely on binding of the polynucleotide to DNA or RNA. For these techniques, 
preferred polynucleotides are usually 20 to 40 bases in length and complementary to 
either the region of the gene involved in transcription (triple helix - see Lee, et al, NucL 
Acids Res. 6:3073 (1979); Cooney, et al, Science 241:456 (1988); and Dervan, et 
ai, Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano, J. 
Neurochem. 56:560 (1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, FL (1988)). Triple helix formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA 
hybridization blocks translation of an mRNA molecule into polypeptide. Both 
techniques are effective in model systems, and the information disclosed herein can be 
used to design antisense or triple helix polynucleotides in an effort to treat disease. 

IL-21 and IL-22 polynucleotides are also useful in gene therapy. One goal of 
gene therapy is to insert a normal gene into an organism having a defective gene, in an 
effort to correct the genetic defect. IL-21 and IL-22 offer means of targeting such 
genetic defects in a highly accurate manner. Another goal is to insert a new gene that 
was not present in the host genome, thereby producing a new trait in the host cell. 

The IL-21 and IL-22 polynucleotides are also useful for identifying individuals 
from minute biological samples. The United States military, for example, is 
considering the use of restriction fragment length polymorphism (RFLP) for 
identification of its personnel. In this technique, an individual's genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 
unique bands for identifying personnel. This method does not suffer from the current 
limitations of "Dog Tags" which can be lost, switched, or stolen, making positive 
identification difficult. The IL-21 and IL-22 polynucleotides can be used as additional 
DNA markers for RFLP. 

The IL-21and IL-22 polynucleotides can also be used as an alternative to 
RFLP, by determining the actual base-by-base DNA sequence of selected portions of an 
individual's genome. These sequences can be used to prepare PCR primers for 
amplifying and isolating such selected DNA, which can then be sequenced. Using this 
technique, individuals can be identified because each individual will have a unique set 
of DNA sequences. Once an unique ID database is established for an individual, 
positive identification of that individual, living or dead, can be made from extremely 
small tissue samples. 

Forensic biology also benefits from using DNA-based identification techniques 
as disclosed herein. DNA sequences taken from very small biological samples such as 
tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, etc., can be 
amplified using PCR. In one prior art technique, gene sequences amplified from 
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polymorphic loci, such as DQa class II HLA gene, are used in forensic biology to 
identify individuals (Erlich, H., PCR Technology, Freeman and Co. (1992)). Once 
these specific polymorphic loci are amplified, they are digested with one or more 
restriction enzymes, yielding an identifying set of bands on a Southern blot probed with 
DNA corresponding to the DQa class II HLA gene. Similarly, IL-21 and IL-22 
polynucleotides can be used as polymorphic markers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular 
tissue. Such need arises, for example, in forensics when presented with tissue of 
unknown origin. Appropriate reagents can comprise, for example, DNA probes or 
primers specific to particular tissue prepared from IL-21 and IL-22 sequences. Panels 
of such reagents can identify tissue by species and/or by organ type. In a similar 
fashion, these reagents can be used to screen tissue cultures for contamination. 

Because EL-21 is found expressed almost exclusively in apoptotic T-cells, IL-21 
polynucleotides are useful as hybridization probes for differential identification of the 
tissue(s) or cell type(s) present in a biological sample. Similarly, polypeptides and 
antibodies directed to IL-21 polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). In addition, for a number of 
disorders of the above tissues or cells, particularly of the Immune system, significantly 
higher or lower levels of IL-21 gene expression may be detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e^g., serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to a 
"standard" IL-21 gene expression level, i.e., the IL-21 expression level in healthy 
tissue from an individual not having the Immune system disorder. 

Likewise, since IL-22 is found expressed in bone marrow, skeletal muscle, and 
brain, IL-22 polynucleotides are useful as hybridization probes for differential 
identification of the tissue(s) or cell type(s) present in a biological sample. Similarly, 
polypeptides and antibodies directed to IL-22 polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). In 
addition, for a number of disorders of the above tissues or ceils, particularly of the 
Immune system, significantly higher or lower levels of IL-22 gene expression may be 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to a "standard" IL-22 gene expression level, i.e., the IL-22 ■ 
expression level in healthy tissue from an individual not having the Immune system 
disorder. 

Thus, the invention provides a diagnostic method of a disorder, which involves: 
(a) assaying IL-21 or IL-22 gene expression level in cells or body fluid of an individual; 
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(b) comparing the IL-21 or IL-22 gene expression level with a standard IL-21 or IL-22 
gene expression level, respectively, whereby an increase or decrease in the assayed 
IL-21 or IL-22 gene expression level compared to the standard expression level is 
indicative of disorder in the Immune system. 

In the very least, the IL-21 and IL-22 polynucleotides can be used as molecular 
weight markers on Southern gels, as diagnostic probes for the presence of a specific 
mRNA in a particular cell type, as a probe to "subtract-out" known sequences in the 
process of discovering novel polynucleotides, for selecting and making oligomers for 
attachment to a "gene chip" or other support, to raise anti-DNA antibodies using DNA 
immunization techniques, and as an antigen to elicit an immune response. 

Uses of IL-21 and IL-22 Polypeptides 

IL-21 and IL-22 polypeptides can be used in numerous ways. The following 
description should be considered exemplary and utilizes known techniques. 

IL-21 and IL-22 polypeptides can be used to assay protein levels in a biological 
sample using antibody-based techniques. For example, protein expression in tissues 
can be studied with classical immunohistological methods (Jalkanen, M., et al, J. Cell 
Biol 101:976-985 (1985); Jalkanen, M., et al, J. Cell Biol 105:3087-3096 
(1987)). Other antibody-based methods useful for detecting protein gene expression 
include immunoassays, such as the enzyme linked immunosorbent assay (ELIS A) and 
the radioimmunoassay (RIA). Suitable antibody assay labels are known in the art and 
include enzyme labels, such as, glucose oxidase, and radioisotopes, such as iodine 
( ,25 I, m l), carbon ( 14 C), sulfur ( 33 S), tritium ( 3 H), indium ( ,l2 In), and technetium 
(""Tc), and fluorescent labels, such as fluorescein and rhodamine, and biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for in vivo 
imaging of protein include those detectable by X-radiography , NMR or ESR. For 
X-radiography, suitable labels include radioisotopes such as barium or cesium, which 
emit detectable radiation but are not overtly harmful to the subject. Suitable markers for 
NMR and ESR include those with a detectable characteristic spin, such as deuterium, 
which may be incorporated into the antibody by labeling of nutrients for the relevant 
hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with 
an appropriate detectable imaging moiety, such as a radioisotope (for example, 13, T, 
n2 In, Wm Tc), a radio-opaque substance, or a material detectable by nuclear magnetic 
resonance, is introduced (for example, parenterally, subcutaneously, or 
intraperitoneally) into the mammal. It will be understood in the art that the size of the 
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subject and the imaging system used will determine the quantity of imaging moiety 
needed to produce diagnostic images. In ihe case of a radioisotope moiety, for a human 
subject, the quantity of radioactivity injected will normally range from about 5 to 20 
millicuries of ^^c. The labeled antibody or antibody fragment will then preferentially 
accumulate at the location of cells which contain the specific protein. In vivo tumor 
imaging is described by Burchiel and colleagues ("Immunopharmacolcinetics of 
Radiolabeled Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: The 
Radiochemical Detection of Cancer, S.W. Burchiel and B. A; Rhodes, eds,, Masson 
Publishing Inc. (1982)). 

Thus, the invention provides a diagnostic method of a disorder, which involves 
(a) assaying the expression of IL-21 or IL-22 polypeptides in cells or body fluid of an 
individual; (b) comparing the level of IL-21 or IL-22 gene expression with a standard 
gene expression level, whereby an increase or decrease in the assayed IL-21 or IL-22 
polypeptide gene expression level compared to the standard expression level is 
indicative of a disorder. 

Moreover, IL-21 and IL-22 polypeptides can be used to treat disease. For 
example, patients can be administered IL-21 and IL-22 polypeptides in an effort to 
replace absent or decreased levels of the IL-21 and IL-22 polypeptides, respectively, 
(e.g., insulin), to supplement absent or decreased levels of a different polypeptide 
(e.g., hemoglobin S for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an 
oncogene), to activate the activity of a polypeptide (e.g., by binding to a receptor), to 
reduce the activity of a membrane bound receptor by competing with it for free ligand 
(e.g., soluble TNF receptors used in reducing inflammation), or to bring about a 
desired response (e.g., blood vessel growth). 

Similarly, antibodies directed to IL-21 and IL-22 polypeptides can also be used 
to treat disease. For example, administration of an antibody directed to an IL-21 or 
IL-22 polypeptide can bind and reduce overproduction of the polypeptide. Similarly, 
administration of an antibody can activate the polypeptide, such as by binding to a 
polypeptide bound to a membrane (receptor). 

At the very least, the IL-21 and IL-22 polypeptides can be used as molecular 
weight markers on SDS-PAGE gels or on molecular sieve gel filtration columns using 
methods well known to those of skill in the art. IL-21 and IL-22 polypeptides can also 
be used to raise antibodies, which, in turn, are used to measure protein expression from 
a recombinant cell, as a way of assessing transformation of the host cell. Moreover, 
IL-21 and IL-22 polypeptides can be used to test the following biological activities.. 
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Biological Activities of IL-21 and IL-22 

TL-21 and IL-22 polynucleotides and polypeptides can be used in assays to test 
for one or more biological activities. If IL-21 and IL-22 polynucleotides and 
polypeptides do exhibit activity in a particular assay, it is likely that IL-21 and IL-22 
may be involved in the diseases associated with the biological activity. Therefore, 
IL-21 and TL-22 could be used to treat the associated disease. 

The IL-21 and IL-22 proteins of the present invention modulate fL-6 secretion 
from NIH-3T3 cells. An in vitro ELTS A assay which quantitates the amount of IL-6 
secreted from cells in response to treatment with cytokines or the soluble extracellular 
domains of cytokine receptors has been described (Yao, Z., et al., Immunity 3:81 1-82 1 
(1995)). Briefly, the assay involves plating the target cells at a density of 
approximately 5 x 10 6 cells/mL in a volume of 500 u\L in the wells of a 24 well 
flat-bottomed culture plate (Costar). The cultures are then treated with various 
concentrations of the cytokine or the soluble extracellular domain of cytokine receptor in 
question. The cells are then cultured for 24 hours at 37*C. At this time, 50 pL of 
supernatant is removed and assayed for the quantity of IL-6 essentially as described by 
the manufacturer (Genzyme, Boston, MA). IL-6 levels are then calculated by reference 
to a standard curve constructed with recombinant IL-17 cytokine. Such activity is 
useful for determining the level of IL-21- or IL-22-mediated IL-6 secretion. 

IL-21 and IL-22 protein modulates immune system cell proliferation and 
differentiation in a dose-dependent manner in the above-described assay. Thus, "a 
polypeptide having IL-21 or IL-22 protein activity" includes polypeptides that also 
exhibit any of the same stimulatory activities in the above-described assays in a 
dose-dependent manner. Although the degree of dose-dependent activity need not he 
identical to that of the IL-21 or IL-22 proteins, preferably, "a polypeptide having IL-21 
or IL-22 protein activity" will exhibit substantially similar dose-dependence in a given 
activity as compared to the IL-21 or IL-22 protein (i.e., the candidate polypeptide will 
exhibit greater activity or not more than about 25-fold less and, preferably, not more 
than about tenfold less activity relative to the reference IL-21 or IL-22 protein). 

Lymphocyte proliferation is another in vitro assay which may be performed to 
determine the activity of TL-21 and IL-22. For example, Yao and colleagues {Immunity 
3:81 1-821 (1995)) have recently described an in vitro assay for determining the effects 
of various cytokines and soluble cytokine receptors on the proliferation of murine 
leukocytes. Briefly, lymphoid organs are harvested aseptically, lymphocytes are 
isolated from the harvested organs, and the resulting collection of lymphoid cells are 
suspended in standard culture medium as described by Fanslow and coworkers (7. 
Immunol. 147:535-5540 (1991)). The lymphoid cell suspensions may then be divided 
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into several different subclasses of lymphoid cells including splenic T-cells, lymph 
node B -cells, CD4' and CD8 + T-cells, and mature adult thymocytes. For splenic 
T-celis, spleen cell suspensions (200 x 10 6 cells) are incubated with CD lib mAb and 
class TI MHC mAb for 30 min at 4*C, loaded on aT-cell purification column (Pierce, 
5 Rockford, 1L), and the T-celis eluted according to the manufacturer's instructions. 

Using this method, purity of the resulting T-cell populations should be >95% CD3 + and 
<\% sIgM + . For purification of lymph node subsets, B-cells are removed from by 
adherence to tissue culture dishes previously coated with goat anti-mouse IgG 
(lOu^g/mL). Remaining cells were then incubated with anti-CD4 or anti-CD8 for 30 

10 min at 4°C then washed and placed on tissue culture dishes previously coated with goat 
anti-rat IgG (20 u.g/mL). After 45 min, nonadherent cells are removed and tested for 
purity by flow cytometry. CD4 and surface Ig-depleted cells should be >90% TCR-ab, 
CD8*, whereas CD8 and surface Ig-depleted cells should be >95% TCR-ab, CD4\ 
Finally, to enrich for mature adult thymocytes, cells are suspended at lOVmL in 10% 

15 anti-HSA and 10% low tox rabbit complement (Ccdarlane, Ontario, Canada), incubated 
for 45 min at 37 °C, and remaining viable cells isolated over Ficoll-Hypaque 
(Pharmacia, Piscataway, NJ). This procedure should yield between 90 and 95% CT33 hi 
cells that are either CD4 + 8- or CD4'8 + . 

20 Immune Activity 

IL-21 and EL-22 polypeptides or polynucleotides may be useful in treating 
deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop through a process called hematopoiesis, producing myeloid (platelets, red 

25 blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) cells 
from pluripotent stem cells. The etiology of these immune deficiencies or disorders 
may be genetic, somatic, such as cancer or some autoimmune disorders, acquired (e.g., 
by chemotherapy or toxins), or infectious. Moreover, IL-21 and TL-22 polynucleotides 
or polypeptides can be used as a marker or detector of a particular immune system 

30 disease or disorder. 

IL-21 and EL-22 polynucleotides or polypeptides may be useful in treating or 
detecting deficiencies or disorders of hematopoietic cells. EL-21 and IL-22 polypeptides 
or polynucleotides could be used to increase differentiation and proliferation of 
hematopoietic cells, including the pluripotent stem cells, in an effort to treat those 

35 disorders associated with a decrease in certain (or many) types hematopoietic ceils. 

Examples of immunologic deficiency syndromes include, but are not limited to: blood 
protein disorders (e.g. agammaglobulinemia, dysgammaglobulinemia), ataxia 
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telangiectasia, common variable immunodeficiency, Digeorge Syndrome, HTV 
infection, HTLV-BLV infection, leukocyte adhesion deficiency syndrome, 
lymphopenia, phagocyte bactericidal dysfunction, severe combined immunodeficiency 
(SCIDs), Wiskott-Aldrich Disorder, anemia, thrombocytopenia, or hemoglobinuria. 

Moreover, IL-21 and IL-22 polypeptides or polynucleotides can also be used to 
modulate hemostatic (the stopping of bleeding) or thrombolytic activity (clot formation). 
For example, by increasing hemostatic or thrombolytic activity, TL-21 and IL-22 
polynucleotides or polypeptides could be used to treat blood coagulation disorders 
(e.g., afibrinogenemia, factor deficiencies), blood platelet disorders (e.g. 
thrombocytopenia), or wounds resulting from trauma, surgery, or other causes. 
Alternatively, IL-21 and IL-22 polynucleotides or polypeptides that can decrease 
hemostatic or thrombolytic activity could be used to inhibit or dissolve clotting, 
important in the treatment of heart attacks (infarction), strokes, or scarring. 

IL-21 and IL-22 polynucleotides or polypeptides may also be useful in treating 
or detecting autoimmune disorders. Many autoimmune disorders result from 
inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading 1 to the destruction of the 
host tissue. Therefore, the administration of IL-21 and IL-22 polypeptides or 
polynucleotides that can inhibit an immune response, particularly the proliferation, 
differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by IL-21 and 
IL-22 include, but are not limited to: Addison's Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, 
glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, Multiple Sclerosis, 
Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, Pemphigus, 
Polyendocrinopathies, Purpura, Retter's Disease, Stiff-Man Syndrome, Autoimmune 
Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary Inflammation, 
Guillain-Barre Syndrome, insulin dependent diabetes meltitis, and autoimmune 
inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly allergic 
asthma) or other respiratory problems, may also be treated by IL-21 and IL-22 
polypeptides or polynucleotides. Moreover, IL-21 and IL-22 can be used to treat 
anaphylaxis, hypersensitivity to an antigenic molecule, or blood group incompatibility. 

IL-21 and IL-22 polynucleotides or polypeptides may also be used to treat 
and/or prevent organ rejection or graft-vcrsus-host disease (GVHD). Organ rejection 
occurs by host immune cell destruction of the transplanted tissue through an immune 
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response. Similarly, an immune response is also involved in GVHD, but, in this case, 
the foreign transplanted immune cells destroy the host tissues. The administration of 
IL-21 and IL-22 polypeptides or polynucleotides that inhibits an immune response, 
particularly the proliferation, differentiation, or chemotaxis of T-cells, may be an 
effective therapy in preventing organ rejection or GVHD. 

Similarly, IL-21 and IL-22 polypeptides or polynucleotides may also be used to 
modulate inflammation. For example, IL-21 and IL-22 polypeptides or polynucleotides 
may inhibit the proliferation and differentiation of cells involved in an inflammatory 
response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including inflammation associated with infection (e.g., septic 
shock, sepsis, or systemic inflammatory response syndrome (SIRS)), 
ischemia-reperfusion injury, endotoxin lethality, arthritis, complement-mediated 
hyperacute rejection, nephritis, cytokine or chemokine induced lung injury, 
inflammatory bowel disease, Crohn's disease, or resulting from over production of 
cytokines (e.g., TNF or IL-1.) 

Hyperproliferative Disorders 

IL-21 and IL-22 polypeptides or polynucleotides can be used to treat or detect 
hyperproliferative disorders, including neoplasms. IL-21 and EL-22 polypeptides or 
polynucleotides may inhibit the proliferation of the disorder through direct or indirect 
interactions. Alternatively, IL-21 and IL-22 polypeptides or polynucleotides may 
proliferate other cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, differentiating, 
or mobilizing T-cells, hyperproliferative disorders can be treated. This immune 
response may be increased by either enhancing an existing immune response, or by 
initiating a new immune response. Alternatively, decreasing an immune response may 
also be a method of treating hyperproliferative disorders, such as a chemotherapeutic 
agent. 

Examples of hyperproliferative disorders that can be treated or detected by IL-21 
and IL-22 polynucleotides or polypeptides include, but are not limited to neoplasms 
located in the: abdomen, bone, breast, digestive system, liver, pancreas, peritoneum, 
endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, thyroid), 
eye, head and neck, nervous (central and peripheral), lymphatic system, pelvic, skin, 
soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by 
IL-21 and IL-22 polynucleotides or polypeptides. Examples of such hyperproliferative 
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disorders include, but are not limited to: hypergammaglobulinemia, 
lymphoproliferative disorders, paraproteinemias, purpura, sarcoidosis, Sezary 
Syndrome, Waldenstron's Macroglobulinemia, Gaucher's Disease, histiocytosis, and 
any other hyperproliferative disease, besides neoplasia, located in an organ system 
listed above. 

In fectious Disease 

IL-21 and 1L-22 polypeptides or polynucleotides can be used to treat or detect 
infectious agents. For example, by increasing the immune response, particularly 
increasing the proliferation and differentiation of B and/or T cells, infectious diseases 
may be treated. The immune response may be increased by either enhancing an existing 
immune response, or by initiating a new immune response. Alternatively, IL-21 and 
IL-22 polypeptides or polynucleotides may also directly inhibit the infectious agent, 
without necessarily eliciting an immune response. 

Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by IL-21 and IL-22 polynucleotides or 
polypeptides. Examples of viruses, include, but are not limited to the following DNA 
and RNA viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, 
Birnaviridae, Bunyaviridae, Caltciviridae, Circoviridae, Coronaviridae, Flaviviridae, 
Hepadnavihdae (Hepatitis), Herpesviridae (such as, Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g., Influenza), Papovaviridae, Parvoviridae, 
Picornaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 
Rotavirus), Retroviridae (HTLV-T, HTLV-fl, Lentivirus), and Togaviridae (e.g., 
Rubivirus). Viruses falling within these families can cause a variety of diseases or 
symptoms, including, but not limited to: arthritis, bronchiol litis, encephalitis, eye 
infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 
pneumonia, Burton's Lymphoma, chickenpox , hemorrhagic fever, Measles, Mumps, 
Parainfluenza, Rabies, the common cold, Polio, leukemia, Rubella, sexually 
transmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. IL-21 and 
IL-22 polypeptides or polynucleotides can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and that 
can be treated or detected by IL-21 and IL-22 polynucleotides or polypeptides include, 
but not limited to, the following Gram- Negative and Gram-positive bacterial families 
and fungi: Actinomycctales (e.g., Corynebacterium, Mycobacterium, Norcardia), 
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Aspergillosis, Bacillaceae (e.g., Anthrax, Clostridium), Bacteroidaceae, 
Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, Campylobacter, 
Coccidioidomycosis, Cryptococcosis, Dermatocycoses, Enterobacteriaceae (KJebsiella, 
Salmonella, Serratia, Yersinia), Erysipelothrix, Helicobacter, Legionellosis, 
Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceac (e.g., Acinetobacter, 
Gonorrhea, Menigococcal), Pasteurellacea Infections (e.g., Actinobacillus, 
Heamophilus, Pasteurella), Pseudomonas, Rickertsiaceae, Chlamydiaceae, Syphilis, 
and Staphylococcal. These bacterial or fungal families can cause the following diseases 
or symptoms, including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., AIDS 
related infections), paronychia, prosthesis- related infections, Reiter's Disease, 
respiratory tract infections, such as Whooping Cough or Empyema, sepsis, Lyme 
Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, food poisoning. 
Typhoid, pneumonia, Gonorrhea, meningitis, Chlamydia, Syphilis, Diphtheria, 
Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, 
impetigo, Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin diseases 
(e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound infections. 
rL-21 and IL-22 polypeptides or polynucleotides can be used to treat or detect any of 
these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
delected by IL-21 polynucleotides or polypeptides include, but not limited to, the 
following families: Amebiasis, Babesiosis, Coccidiosis, Cryptosporidiosis, 
Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, Helminthiasis, Leishmaniasis, 
Theileriasis, Toxoplasmosis, Trypanosomiasis, and Trichomonas. These parasites can 
cause a variety of diseases or symptoms, including, but not limited to: Scabies, 
Trombiculiasis, eye infections, intestinal disease (e.g., dysentery, giardiasis), liver 
disease, lung disease, opportunistic infections (e.g., AIDS related), Malaria, pregnancy 
complications, and toxoplasmosis. IL-21 and IL-22 polypeptides or polynucleotides 
can be used to treat or detect any of these symptoms or diseases. 

Preferably, treatment using IL-21 and IL-22 polypeptides or polynucleotides 
could either be by administering an effective amount of IL-21 or TL-22 polypeptide to 
the patient, or by removing cells from the patient, supplying the cells with TL-21 and 
IL-22 polynucleotide, and returning the engineered cells to the patient (ex vivo therapy). 
Moreover, the IL-21 and IL-22 polypeptide or polynucleotide can be used as an antigen 
in a vaccine to raise an immune response against infectious disease. 
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Regeneration 

IL-21 and IL-22 polynucleotides or polypeptides can be used to differentiate, 
proliferate, and attract cells, leading to the regeneration of tissues (see, Science 
276:59-87 (1997)). The regeneration of tissues could be used to repair, replace, or 
protect tissue damaged by congenital defects, trauma (wounds, burns, incisions, or 
ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal disease, liver 
failure), surgery, including cosmetic plastic surgery, fibrosis, reperfusion injury, or 
systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal 
or cardiac), vascular (including vascular endothelium), nervous, hematopoietic, and 
skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs 
without or decreased scarring. Regeneration also may include angiogenesis. 

Moreover, IL-21 and IL-22 polynucleotides or polypeptides may increase 
regeneration of tissues difficult to heal. For example, increased tendon/ligament 
regeneration would quicken recovery time after damage. IL : 21 and IL-22 
polynucleotides or polypeptides of the present invention could also be used 
prophylactically in an effort to avoid damage. Specific diseases that could be treated 
include of tendinitis, carpal tunnel syndrome, and other tendon or ligament defects. , A 
further example of tissue regeneration of non-healing wounds includes pressure ulcers, 
ulcers associated with vascular insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using IL-21 and 
IL-22 polynucleotides or polypeptides to proliferate and differentiate nerve cells. 
Diseases that could be treated using this method include central and peripheral nervous 
system diseases, neuropathies, or mechanical and traumatic disorders (e.g., spinal cord 
disorders, head trauma, cerebrovascular disease, and stoke). Specifically, diseases 
associated with peripheral nerve injuries, peripheral neuropathy (e.g., resulting from 
chemotherapy or other medical therapies), localized neuropathies, and central nervous 
system diseases (e.g., Alzheimer's disease, Parkinson's disease, Huntington's disease, 
amyotrophic lateral sclerosis, and Shy-Drager syndrome), could all be treated using the 
IL-21 and IL-22 polynucleotides or polypeptides. 

Chemotaxis 

IL-21 and IL-22 polynucleotides or polypeptides may have chemotaxis activity. 
A chemotaxic molecule attracts or mobilizes cells (e.g., monocytes, fibroblasts, 
neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells) to a 
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particular site in the body, such as inflammation, infection, or site of hyperproliferation. 
The mobilized cells can then fight off and/or heal the particular trauma or abnormality. 

IL-2I and EL-22 polynucleotides or polypeptides may increase chemotaxic 
activity of particular cells. These chemotactic molecules can then be used to treat 
5 inflammation, infection, hyperproliferative disorders, or any immune system disorder 
by increasing the number of cells targeted to a particular location in the body. For 
example, chemotaxic molecules can be used to treat wounds and other trauma to tissues 
by attracting immune cells to the injured location. As a chemotactic molecule , IL-21 
and IL-22 could also attract fibroblasts, which can be used to treat wounds. 
10 It is also contemplated that IL-21 and TL-22 polynucleotides or polypeptides 

may inhibit chemotactic activity. These molecules could also be used to treat disorders. 
Thus, IL-21 and IL-22 polynucleotides or polypeptides could be used as an inhibitor of 
chemotaxis. 

15 Binding Activity 

IL-21 and IL-22 polypeptides may be used to screen for molecules that bind to 
IL-21 or IL-22 or for molecules to which IL-21 or IL-22 bind. The binding of IL-21 
and IL-22 and the molecule may activate (agonist), increase, inhibit (antagonist), or 
decrease activity of the IL-21 and IL-22 or the molecule bound. Examples of such 
20 molecules include antibodies, oligonucleotides, proteins (e.g., receptors),or small 
molecules. 

Preferably, the molecule is closely related to the natural ligand of IL-21 or 
IL-22, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural or 
functional mimetic (see, Coligan, et aL y Current Protocols in Immunology l(2):Chapter 

25 5 (1991)). Similarly, the molecule can be closely related to the natural receptor to 
which IL-21 and IL-22 bind, or at least, a fragment of the receptor capable of being 
bound by IL-21 or IL-22 (e.g., active site). In either case, the molecule can be 
rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate 

30 cells which express IL-21 and TL-22, either as a secreted protein or on the cell 

membrane. Preferred cells include cells from mammals, yeast, Drosophila, or E. colL 
Cells expressing TL-2 1 and IL-22 (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially containing 
the molecule to observe binding, stimulation, or inhibition of activity of either IL-21 

35 and IL-22 or the molecule. 

The assay may simply test binding of a candidate compound to IL-21 or IL-22, 
wherein binding is detected by a label, or in an assay involving competition with a 
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labeled competitor. Further, the assay may test whether the candidate compound results 
in a signal generated by binding to 1L-21 or IL-22. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed 10 a solid support, chemical libraries, or natural product 
5 mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing IL-21 or IL-22, measuring IL-21/molecule or 
lL-22/molecule activity or binding, respectively, and comparing the IL-21 /molecule or 
lL-22/molecule activity or binding to a standard. 

Preferably, an ELISA assay can measure IL-21 and IL-22 levels or activities in 
10 a sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
anubody can measure IL-21 and IL-22 levels or activities by either binding, directly or 
indirectly, to IL-21 or IL-22 or by competing with IL-21 or IL-22 for a substrate, 
p All of these above assays can be used as diagnostic or prognostic markers. The 

D molecules discovered using these assays can be used to treat disease or to bring about a 

15 particular result in a patient (e.g., blood vessel growth) by activating or inhibiting IL-21 
y or IL-22. Moreover, the assays can discover agents which may inhibit or enhance the 

4« production of IL-2 1 and IL-22 from suitably manipulated cells or tissues. 

Therefore, the invention includes a method of identifying compounds which 
□ bind to IL-21 and IL-22 comprising the steps of: (a) incubating a candidate binding 

|!J 20 compound with IL-21 or TL-22; and (b) determining if binding has occurred. 

u5 Moreover, the invention includes a method of identifying agonists/antagonists 

*3 comprising the steps of: (a) incubating a candidate compound with IL-21 or EL-22, (b) 

assaying a biological activity , and (b) determining if a biological activity of IL-21 or 



TL-22, respectively, has been altered. 
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Other Activities 

IL-21 and IL-22 polypeptides or polynucleotides may also increase or decrease 
the differentiation or proliferation of embryonic stem cells, besides, as discussed above, 
hematopoietic lineage. 

30 IL-2 1 and IL-22 polypeptides or polynucleotides may also be used to modulate 

mammalian characteristics, such as body height, weight, hair color, eye color, skin, 
percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic surgery). 
Similarly, IL-21 and IL-22 polypeptides or polynucleotides may be used to modulate 
mammahan metabolism affecting catabolism, anabolism, processing, utilization, and 

35 storage of energy. 

IL-21 and IL-22 polypeptides or polynucleotides may be used to change a 
mammal's mental state or physical state by influencing biorhythms, caricadic rhythms, 
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depression (including depressive disorders), tendency for violence, tolerance for pain, 
reproductive capabilities (preferably by Activtn or Inhibin-like activity), hormonal or 
endocrine levels, appetite, libido, memory, stress, or other cognitive qualities. 

IL-21 and IL-22 polypeptides or polynucleotides may also be used as a food 
additive or preservative, such as to increase or decrease storage capabilities, fat content, 
lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional 
components. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

Examples 

In the case where IL-22 is not specifically mentioned, specific details are 
provided in the following examples only for the IL-21 molecules of the present 
invention. However, the examples can also be easily performed for the IL-22 . 
molecules of the present invention by using the details provided for IL-21 and 
substituting appropriate nucleotides or amino acid residues of IL-22, for example, in the 
design of suitable PCR primers, and the like. The use or applicability of TL-22 in place 
of IL-21 is thus contemplated in each of the following examples. When provided with 
the nucleotide and amino acid sequences of IL-21 (SEQ ID NO: 1 and SEQ ID NO;2) 
and IL-22 (SEQ ED NO:3 and SEQ ID NO:4) of the present invention, one of ordinary 
skill in the art could easily perform the following examples with the intent of isolating 
or further characterizing or manipulating IL-22 in place of IL-2 1 . 

Example 1; Isolation of the IL-21 and IL-22 cDNA Clones From the 
Deposited Samples 

The cDNAs encoding IL-21 and IL-22 are each inserted into the Eco RI and 
Xho I restriction sites of the multiple cloning site of pBluescript. pBluescript contains 
an ampicillin resistance gene and may be transformed into E. coli strain DH10B, 
available from Life Technologies (see, for instance, Gruber, C. E., et al., Focus 15:59 
(1993)). 

Two approaches can be used to isolate IL-21 from the deposited sample. First,a 
specific polynucleotide of SEQ ID NO:l with 30-40 nucleotides is synthesized using an 
Applied Biosy stems DNA synthesizer according to the sequence reported. The 
oligonucleotide is labeled, for instance, with 32 P-y-ATP using T4 polynucleotide kinase 
and purified according to routine methods (e.g., Maniatis, etai y Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982)). The 
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plasmid mixture is transformed into a suitable host (such as XL-1 Blue (Stratagene)) 
using techniques known to those of skill in the art, such as those provided by the vector 
supplier or in related publications or patents. The transformants are plated on 1 .5% 
agar plates (containing the appropriate selection agent, e.g., ampicillin) to a density of 
about 150 transformants (colonies) per plate. These plates are screened using Nylon 
membranes according to routine methods for bacterial colony screening (e.g., 
Sambrook, et al, Molecular Cloning: A Laboratory Manual, 2nd Edit., (1989), Cold 
Spring Harbor Laboratory Press, pages 1 .93 to 1.104), or other techniques known to 
those of skill in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the 
SEQ ID NO:l (i.e., within the region of SEQ ID NO:l bounded by the 5' and 3' 
nucleotides of the clone) are synthesized and used to amplify the IL-21 cDNA using the 
deposited cDNA plasmid as a template. The polymerase chain reaction is carried out 
under routine conditions, for instance, in 25 ^il of reaction mixture with 0.5 ug of the 
above cDNA template. A convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01% 
(w/v) gelatin, 20 uM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 
0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94°C for 1 
min; annealing at 55°C for 1 min; elongation at 72*C for 1 min) are performed with a 
Perkin-Elmer Cetus automated thermal cycler. The amplified product is analyzed by 
agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PCR product is verified to be the selected sequence by 
subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of the 1L-2 1 gene which may not be present in the deposited clone. These 
methods include, but are not limited to, filter probing, clone enrichment using specific 
probes, and protocols similar or identical to 5' and 3* RACE protocols which are well 
known in the art. For instance, a method similar to 5* RACE is available for generating 
the missing 5* end of a desired full-length transcript (Fromont-Racine, et a!., Nucl. 
Acids Res. 21(7): 1683- 1684 (1993)). 

Briefly, a specific RNA oligonucleotide is ligated to the 5* ends of a population 
of RNA presumably containing full-length gene RNA transcripts. A primer set 
containing a primer specific to the ligated RNA oligonucleotide and a primer specific to 
a known sequence of the IL-21 gene of interest is used to PCR amplify the 5' portion of 
the IL-21 full-length gene. This amplified product may then be sequenced and used to 
generate the full length gene. 

This above method starts with total RNA isolated from the desired source, 
although poly- A+ RNA can be used. The RNA preparation can then be treated with a 
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phosphatase, if necessary, to eliminate 5* phosphate groups on degraded or damaged 
RNA which may interfere with the later RNA ligase step. The phosphatase should then 
be inactivated and the RNA treated with tobacco acid pyrophosphatase in order to 
remove the cap structure present at the 5' ends of messenger RNA. This reaction leaves 
a 5' phosphate group at the 5' end of the cap cleaved RNA which can then be ligatcd to 
an RNA oligonucleotide using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PCR amplification of the desired 5' end using a primer specific to 
the ligated RNA oligonucleotide and a primer specific to the known sequence of the 
gene of interest. The resultant product is then sequenced and analyzed to confirm thai 
the 5' end sequence belongs to the IL-21 gene. 

Example 2: Isolation of IL-21 Genomic Clones 

A human genomic PI library (Genomic Systems, Inc.) is screened by PCR 
using primers selected for the cDNA sequence corresponding to SEQ ID NO:l 
according to the method described in Example 1 (see also, Sambrook, et al, supra). 

Example 3: Tissue Distribution of IL-21 

Tissue distribution of mRNA expression of IL-21 is determined using protocols 
for Northern blot analysis, described by, among others, Sambrook and colleagues 
(supra). For example, an IL-21 probe produced by the method described in Example 1 
is labeled with 32 P using the rediprime™ DNA labeling system (Amersham Life 
Science), according to manufacturer's instructions. After labeling, the probe is purified 
using a CHROMA SPIN- 100™ column (C)ontech Laboratories, Inc.), according to 
manufacturer's protocol number PT1 200-1. The purified labeled probe is then used to 
examine various human tissues for mRNA expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) or 
human immune system (IM) tissues (Clontech) are examined with the labeled probe 
using ExpressHyb™ hybridization solution (Clontech) according to manufacturer's 
protocol number PT1 190-L Following hybridization and washing, the blots are 
mounted and exposed to film at -70*C overnight, and the films developed according to 
standard procedures. 

Using essentially the above-prescribed protocol, Northern blot analyses were 
performed to determine the expression pattern of IL-21 and IL-22. In the case of 
IL-21, a major message of approximately 5 kb was detected predominantly in thymus, 
but was also detectable in adrenal cortex, spleen, pancreas, and very weakly in lymph 
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node, PBL, fetal liver, adrenal medulla, thyroid, small intestine, stomach, and heart. In 
the case of IL-22, a major message of slightly less than 1 kb was detected in 
conjunction with a minor band of approximately 5 kb predominantly in testis and spinal 
cord, but was also detected in bone marrow and small intestine. 

Example 4: Chromosomal Mapping of IL-21 

An oligonucleotide primer set is designed according to the sequence at the 5' 
end of SEQ ID NO: I. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 
conditions : 30 seconds, 95*C; 1 minute, 56*C; 1 minute, 70 B C. This cycle is repeated 
32 times followed by one 5 minute cycle at 70"C. Human, mouse, and hamster DNA is 
used as template in addition to a somatic cell hybrid panel containing individual 
chromosomes or chromosome fragments (Bios, Inc). The reactions are analyzed on 
either 8% poly aery lamide gels or 3.5 % agarose gels. Chromosome mapping is 
determined by the presence of an approximately 100 bp PCR fragment in the particular 
somatic cell hybrid. 

flxarnpfc g; Bacterial ISxpressjon of IL-21 

An IL-21 polynucleotide encoding an IL-21 polypeptide of the invention is 
amplified using PCR oligonucleotide primers corresponding to the 5' and 3' ends of the 
DNA sequence, as outlined in Example 1, to synthesize insertion fragments. The 
primers used to amplify the cDNA insert should preferably contain restriction sites, 
such as Bam HI and Hin dm, at the 5* end of the primers in order to clone the 
amplified product into the expression vector. For example, Bam HT and Hin dill 
correspond to ihe restriction enzyme sites on the bacterial expression vector pQE-9 
(Qiagen Inc., Chatsworth, CA). This plasmtd vector encodes antibiotic resistance 
(Amp R ), a bacterial origin of replication (ori), an IPTG-rcgulatable promoter/operator 
(P/O), a ribosome binding site (RBS), a 6-histidine tag (6-His), and restriction enzyme 
cloning sites. 

Specifically, to clone the mature domain of the IL-21 protein in a bacterial 
vector, the 5' primer has the sequence 5'-GAT CGC GGA TCC G AC ACG GAT GAG 
GAC CGC TAT CCA CAG AAG CTG-3' (SEQ ID NO:9) containing the underlined 
Bam HI restriction site followed several nucleotides of the amino terminal coding 
sequence of the mature DL-21 sequence in SEQ ID NO:l. One of ordinary skill in the 
art would appreciate, of course, that the point in the protein coding sequence where the 
5' primer begins may be varied to amplify a DNA segment encoding any desired 
portion of the complete IL-21 protein shorter or longer than the mature form of the 
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protein. The 3' primer has the sequence 5*-CCC AAGCTT TCA CAC TGA ACG 
GGG CAG CAC GCA GGT GCA GC-3* (SEQ ID NO: 10) containing the underlined 
Hin dlTI restriction site followed by a number nucleotides complementary to the 3' end 
of the coding sequence of the IL-2 1 DNA sequence of SEQ ID NO: 1 . 

The pQE-9 vector is digested with Bam HI and Hin dTTI and the amplified 
fragment is ligated into the pQE-9 vector maintaining the reading frame initiated at the 
bacterial RBS. The ligation mixture is then used to transform the E. coli strain 
M15/rep4 (Qiagcn, Inc.) which contains multiple copies of the plasmid pREP4, which 
expresses the lad repressor and also confers kanamycin resistance (Kan k ). 
Transformants are identified by their ability to grow on LB plates and colonies are 
selected which are resistant to both ampicillin and kanamycin. Plasmid DNA is isolated 
and confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1:100 to 1 :250. The 
cells are grown to an optical density 600 (O.D >600 ) of between 0.4 and 0.6. IPTG 
(Isopropyl-B-D-thiogalacto pyranoside) is then added to a final concentration of 1 mM. 
IPTG induces by inactivating the lacl repressor, clearing the promoter/operator leading 
to increased gene expression. 

Cells are grown for an additional 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6000 X g). The cell pellet is solnbilized in the chaotropic 
agent 6 M Guanidine-HCl by stirring for 3-4 hours at 4'C. The cell debris is removed 
by centrifugation, and the supernatant containing the polypeptide is loaded onto a 
nickel-nttrilo-tri-acetic acid ("Ni-NTA") affinity resin column (QIAGEN, Inc., supra). 
Proteins with a 6 x His tag bind to the Ni-NTA resin with high affinity and can be 
purified in a simple one-step procedure (for details see: The QIAexpressionist (1 995) 
QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 8, 
the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is eluted with 
6 M guanidine-HCl, pH 5. 

The purified IL-21 protein is then renatured by dialyzing it against 
phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM 
NaCl. Alternatively, the IL-21 protein can be success fully refolded while immobilized 
on the Ni-NTA column. The recommended conditions are as follows: renature using a 
linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 
containing protease inhibitors. The rcnaturation should he performed over a period of 
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1 .5 hours or more. After renaturation the proteins are eluted by the addition of 250 mM 
immidazole. Immidazole is removed by a final dialyzing step against PBS or 50 mM 
sodium acetate pH 6 buffer plus 200 mM NaCl. The purified TL-21 protein is stored at 
4°C or frozen at -80°C. 

In addition to the above expression vector, the present invention further includes 
an expression vector comprising phage operator and promoter elements operatively 
linked to an TL-21 polynucleotide, called pHE4a (ATCC Accession Number 209645, 
deposited February 25, 1998). This vector contains: (1) a neomycin 
phosphotransferase gene as a selection marker, (2) an E. coli origin of replication, (3) a 
T5 phage promoter sequence, (4) two lac operator sequences, (5) a Shine-Delgarno 
sequence, and (6) the lactose operon repressor gene (laclq). The origin of replication 
(oriC) is derived from pUC19 (LTl, Gaithersburg, MD). The promoter sequence and 
operator sequences are made synthetically. 

DN A can be inserted into the pHEa by restricting the vector with Nde I and Xba 
1, Bam HI, Xho I, or Asp 718, running the restricted product on a gel, and isolating the 
larger fragment (the stuffer fragment should be about 310 base pairs). The DNA insert 
is generated according to the PCR protocol described in Example 1 , using PCR primers 
which encode restriction sites for Nde I (5' primer) and Nde I and Xba I, Bam HI, Xho 
1, or Asp 718 (3* primer). The PCR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 

The engineered vector could easily be substituted in the above protocol to 
express protein in a bacterial system. 

Example 6: Purification of IL-21 Polypeptide from an Inclusion Body 

The following alternative method can be used to purify IL-21 polypeptide 
expressed in E coli when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-10 6 C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10'C and the cells harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate amount 
of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 
mM EDTA, pH 7.4. The cells arc dispersed to a homogeneous suspension using a 
high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
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centrifiigation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 
hydrochloride (GuHCI) for 2-4 hours. After 7000 x g centrifugation for 1 5 min., the 
5 pellet is discarded and the polypeptide containing supernatant is incubated at 4*C 
overnight to allow further GuHCI extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 
the GuHCI solubilized protein is refolded by quickly mixing the GuHCI extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 
10 vigorous stirring. The refolded diluted protein solution is kept at 4°C without mixing 
for 12 hours prior to further purification steps. 

To clarity the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0.16 um membrane filter with appropriate surface area . 



□ (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 

OS 15 filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 

yj Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted 

£ with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 

P stepwise manner. The absorbance at 280 nm of the effluent is continuously monitored. 

Q Fractions are collected and further analyzed by SDS-PAGE. 

jjj 20 Fractions containing the IL-21 polypeptide are then pooled and mixed with 4 

volumes of water. The diluted sample is then loaded onto a previously prepared set of 

\Q tandem columns of strong anion (Poros HQ-50, Perseptive Biosy stems) and weak 



anion (Poros CM-20, Perseptive Biosy stems) exchange resins. The columns are 
equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 

25 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using a 
10 column volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, 
pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected under 
constant A^,, monitoring of the effluent. Fractions containing the polypeptide 
(determined, for instance, by 16% SDS-PAGE) are then pooled. 

30 The resultant IL-21 polypeptide should exhibit greater than 95% purity after the 

above refolding and purification steps. No major contaminant bands should be 
observed from Commassie blue stained 16% SDS-PAGE gel when 5 ug of purified 
protein is loaded. The purified IL-21 protein can also be tested for endotoxin/LPS 
contamination, and typically the LPS content is less than 0.1 ng/ml according to LAL 

35 assays. 
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Example 7: Cloning and Expression of IL-21 in a Baculovirus 
Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert IL-21 
polynucleotide into a baculovirus to express DL-21 . This expression vector contains the 
strong polyhedrin promoter of the Autographa califomica nuclear polyhedrosis virus 
(AcMNPV) followed by convenient restriction sites such as Bam HI, Xba I and Asp 
718. The polyadenylation site of the simian virus 40 ("SV40") is used for efficient 
polyadenylation. For easy selection of recombinant virus, the plasmid contains the 
beta-gal actosidase gene from E, coli under control of a weak Drosophila promoter in the 
same orientation, followed by the polyadenylation signal of the polyhedrin gene. The 
inserted genes are flanked on both sides by viral sequences for cell-mediated 
homologous recombination with wild-type viral DNA to generate a viable virus that 
express the cloned IL-21 polynucleotide. 

Many other baculovirus vectors can be used in place of the vector ahove, such 
as pAc373, pVL941 , and pAcIMl, as one skilled in the art would readily appreciate, as 
long as the construct provides appropriately located signals for transcription, 
translation, secretion and the like, including a signal peptide and an in- frame AUG as 
required. Such vectors are described, for instance, by Luckow and colleagues 
(Virology 170:31-39 (1989)). 

Specifically, the IL-21 cDNA sequence contained in the deposited clone, 
including the AUG initiation codon and any naturally associated leader sequence, is 
amplified using the PCR protocol described in Example 1. If the naturally occurring 
signal sequence is used to produce the secreted protein, the pA2 vector does not need a 
second signal peptide. However, since the predicted naturally occurring signal peptides 
of IL-21 and IL-22 are not known, the vector can be modified (now designated 
pA2GP) to include a baculovirus leader sequence, using the standard methods 
described by Summers and coworkers ("A Manual of Methods for Baculovirus Vectors 
and Insect Cell Culture Procedures," Texas Agricultural Experimental Station Bulletin 
No. 1555 (1987)). 

More specifically, the cDNA sequence encoding the full-length TL-21 protein in 
the deposited clone is amplified using PCR oligonucleotide primers corresponding to 
the 5' and 3' sequences of the gene. The 5' primer has the sequence 5'-CGC CGC 
GGA TCC GCCATC CGC ACG AGT GGA CAC GG-3* (SEQ TD NO: 11) containing 
the Bam HI restriction enzyme site, an efficient signal for initiation of translation in 
eukaryotic cells (shown in the primer sequence in italics; Kozak, M., J. Mol Biol. 
196:947-950 (1987)), a "C" residue to preserve the reading frame, and 16 nucleotides 
of the sequence of the complete IL-21 protein shown in Figure 1. The 3' primer has the 
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sequence 5-CGC GOT ACC CAC TGA ACG GGG CAG CAC GC-3' (SRQ ID 
NO: 12) containing the Asp 718 restriction site followed by 20 nucleotides 
complementary to the 3* noncoding sequence in Figure 1 . 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BTO 101 Inc., La Jolla, CA). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 
procedures known in the art. The DNA is then isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean" BIO 101 Inc., La Jolla, CA). 

The fragment and the dephosphorylated plasmid are iigated together with T4 
DNA ligase. E. coli HB101 or other suitable E. colt hosts such as XL-1 Blue 
(Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 
mixture and spread on culture plates. Bacteria containing the plasmid are identified by 
digesting DNA from individual colonies and analyzing the. digestion product by gel 
electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. 

Five ug of a plasmid containing the polynucleotide is co-trans fected with 1.0 u.g 
of a commercially available linearized baculo virus DNA ("BaculoGold™ bacuiovirus 
DNA", Pharmingen, San Diego, CA), using the lipofeclion method described by 
Feigner and colleagues (Proc. Natl. Acad. ScL USA 84:7413-7417 (1987)). One ug 
of BaculoGold™ virus DNA and 5 ug of the plasmid are mixed in a sterile well of a 
microtiter plate containing 50 ul of serum-free Grace's medium (Life Technologies Inc., 
GaiLhersburg, MD). Afterwards, 10 ul Lipofectin plus 90 ul Grace's medium are 
added, mixed and incubated for 15 minutes at room temperature. Then the transfection 
mixture is added drop-wise to Sf9 insect cells (ATCC CRL 1711) seeded in a 35 mm 
tissue culture plate with 1 ml Grace's medium without serum. The plate is then 
incubated for 5 hours at 27°C. The transfection solution is then removed from the plate 
and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum is added. 
Cultivation is then continued at 27°C for four days. 

After four days the supernatant is collected and a plaque assay is performed, as 
described by Summers and Smith (supra). An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of 
gal-expressing clones, which produce blue-stained plaques. (A detailed description of a 
"plaque assay" of this type can also be found in the user's guide for insect cell culture 
and baculo virology distributed by Life Technologies Inc., Gaithersburg, page 9-10.) 
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After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifuge tube containing 200 ul of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded in 
5 35 mm dishes. Four days later the supematants of these culture dishes are harvested 
and then they are stored at 4°C 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace*s 
medium supplemented with 10% heat-inacuvated FBS. The cells are infected with the 
recombinant baculovirus containing the polynucleotide at a multiplicity of infection 

10 ("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later the medium is 
removed and is replaced with SF900 II medium minus methionine and cysteine 
(available from Life Technologies Inc., Rockville, MD). After 42 hours, 5 u.Ci of 
35 S-methionine and 5 u,Ci -cysteine (available from Amersham) are added. The cells 
are further incubated for 16 hours and then are harvested by centrifugation. The 

15 proteins in the supernatant as well as the intracellular proteins are analyzed by 
SDS-PAGE followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of purified 
protein may be used to determine the amino terminal sequence of the produced 1L-21 
protein. 

20 

Example 8; Ex pression of KL-21 In Mammalian Cells 

BL-21 polypeptide can be expressed in a mammalian cell. A typical mammalian 
expression vector contains a promoter element, which mediates the initiation of 
transcription of mRNA, a protein coding sequence, and signals required for the 

25 termination of transcription and polyadenylation of the transcript. Additional elements 
include enhancers, Kozak sequences and intervening sequences flanked by donor and 
acceptor sites for RNA splicing. Highly efficient transcription is achieved with the 
early and late promoters from SV40, the long terminal repeats (LTRs) from 
Retroviruses, e.g., RSV, HTLV-I, HIV-1 and the early promoter of the 

30 cytomegalovirus (CMV). However, cellular elements can also be used (e.g., the 
human actin promoter). 

Suitable expression vectors for use in practicing the present invention include, 
for example, vectors such as pS VL and pMSG (Pharmacia, Uppsala, Sweden), 
. pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), 

35 pCMVSport 2.0, and pCMVSport 3.0. Mammalian host cells that could be used 

include, Hela, 293, H9 and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1 , Cos 7 
and CV1, quail QC1-3 cells, mouse L cells and Chinese hamster ovary (CHO) cells. 
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Alternatively, IL-21 polypeptide can be expressed in stable cell lines containing 
the IL-21 polynucleotide integrated into a chromosome. The co-lransfection with a 
selectable marker such as dhfr, gpt, neomycin or hygromycin allows the identification 
and isolation of the iransfected cells. 

The transfected IL-21 gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) marker is useful in developing 
cell lines that carry several hundred or even several thousand copies of the gene of 
interes (see, e.g., Alt, F. W., et al, J. Biol. Chem. 253:1357-1370 (1978); Hamlin, 
J. L. and Ma, C, Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 
Sydenham, M. A., Biotechnology 9:64-68 (1991)). Another useful selection marker is 
the enzyme glutamine synthase (GS; Murphy, et al. f Biochem. J. 227:277-279 (1991); 
Bebbington, etal, Bio/Technology 10:169-175 (1992)). Using these markers, the 
mammalian cells are grown in selective medium and the cells with the highest resistance 
are selected. These cell lines contain the amplified gene(s) integrated into a 
chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen, et 
al, Mol Cell Biol, 438-447 (March, 1985)) plus a fragment of the CMV-enhancer 
(Boshart, et al, Cell 41:521-530 (1985)). Multiple cloning sites, e.g., with the 
restriction enzyme cleavage sites Bam HI, Xba I and Asp 718, facilitate the cloning of 
IL-21. The vectors also contain the 3' intron, the polyadenylation and termination 
signal of the rat preproinsulin gene, and the mouse DHFR gene under control of the 
SV40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf intestinal phosphates by 
procedures known in the art. The. vector is then isolated from a 1% agarose gel. 

IL-21 polynucleotide is amplified according to the protocol outlined in Example 
1. If the naturally occurring signal sequence is used to produce the secreted protein, the 
vector does not need a second signal peptide. Alternatively, if the naturally occurring 
signal sequence is not used, the vector can he modified to include a heterologous signal 
sequence (see, e.g., WO 96/34891). 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La JoIIa, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1 % agarose gel. 
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The amplified fragment is then digested with the same restriction enzyme and 

purified on a 1% agarose gel. The isolated fragment and the dephosphorylated vector 

are then ligated with T4 DNA ligase. E. coli HB101 or XL-1 Blue cells are then 

transformed and bacteria are identified that contain the fragment inserted into plasmid 

5 pC6 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DTTFR gene is used for 

transfection. Five ug of the expression plasmid pC6 is cotransfected with 0.5 ug of the 

plasmid pSVneo using lipofectin (Feigner, et al., supra). The plasmid pS V2-neo 

contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme that 

10 confers resistance to a group of antibiotics including G418. The cells are seeded in 

alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 

trypsin ized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 

q MEM supplemented with 10, 25, or 50 ng/mi of metothrexate plus 1 mg/ml G418. 

□ After about 10-14 days single clones are trypsinized and then seeded in 6-well petri 

5j 15 dishes or 10 ml flasks using different concentrations of methotrexate (for example, 50 

UJ nM, 100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest 

jjjj concentrations of methotrexate are then transferred to new 6-well plates containing even 

B higher concentrations of methotrexate (1 uM, 2 uM, 5 uM, 10 mM, 20 niM). The 

2 same procedure is repeated until clones are obtained which grow at a concentration of 

Hj 20 100-200 uM. Expression of TL-21 is analyzed, for instance, by SDS-PAGE and 

$ Western blot or by reverse phase IIPLC analysis. 

vO 

£8 

Example 9: Protein Fusions of IL-21 

TL-21 polypeptides are preferably fused to other proteins. These fusion proteins 

25 can be used for a variety of applications. For example, fusion of IL-21 polypeptides to 
His-lag, HA-tag, protein A, IgG domains, and maltose binding protein facilitates 
purification (see Example 5; see also EP A 394,827; Traunecker, et al, Nature 
331:84-86 (1988).) Similarly, fusion to IgG-l, lgG-3, and albumin increases the 
halflife lime in vivo. Nuclear localization signals fused to IL-21 polypeptides can target 

30 the protein to a specific subcellular localization, while covalent heterodimer or 

homodimers can increase or decrease the activity of a fusion protein. Fusion proteins 
can also create chimeric molecules having more than one function. Finally, fusion 
proteins can increase solubility and/or stability of the fused protein compared to the 
non-fused protein. All of the types of fusion proteins described above can be made by 

35 modifying the following protocol, which oudines the fusion of a polypeptide to an IgG 
molecule, or the protocol described in Example 5. 
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Briefly, the human Fc portion of the IgG molecule can be PCR amplified, using 
primers that span the 5' and 3* ends of the sequence described below. These primers 
also should have convenient restriction enzyme sites that will facilitate cloning into an 
expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc portion can 
be ligated into the Bam HI cloning site. Note that the 3' Bam HI site should be 
destroyed. Next, the vector containing the human Fc portion is again restricted with 
Bam HI, linearizing the vector, and IL-21 polynucleotide, isolated by the PCR protocol 
described in Example 1, is ligated into this Bam HI site. Note that the polynucleotide is 
cloned without a stop codon, otherwise a fusion protein will not be produced. 

If the naturally occurring signal sequence is used to produce the secreted 
protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence (see, e.g., WO 96/34891). 

Human IgG Fc region (SEQ ID NO:13): 

GGGATCCGGAGCCC^TCITCIK^ 

TTCCTCTTCXTCCCCAAAACCXIAAGGACACC^ 

CGAAGACCCTGAGGTCAAGTTCAAC^^ 

ACAACAGCACCTACCGTCTGGTC 

TCCAT^AAAGCCCTCCGAACCCCC^ 

GCCCCCATCCCGGGATGAGCTGA 

TGGAGTQGGAGAGCAATGGGCA^ 

TACAGCAAGCTCACCGTGGACAA^ 

CCACTACACGCAGAAGAGCCTCTCC^^ 

Example 10; Production of an Antibody 

The antibodies of the present invention can be prepared by a variety of methods 
(see, Current Protocols, Chapter 2). For example, cells expressing IL-21 is 
administered to an animal to induce the production of sera containing polyclonal 
antibodies. In a preferred method, a preparation of IL-21 protein is prepared and . 
purified to render it substantially free of natural contaminants. Such a preparation is 
then introduced into an animal in order to produce polyclonal antisera of greater specific 
activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology (Kohler, et al, Nature 
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256:495 (1975); Kohler, ex at., Eur. J. Immunol. 6:51 1 (1976); Kohler, et at, Eur. J. 
Immunol. 6:292 (1976); Hammerling, ex ai, in: Monoclonal Antibodies and T-Cell 
Hybridomas, Elsevier, N.Y., pp. 563-681 (1981)). In general, such procedures 
involve immunizing an animal (preferably a mouse) with IL-21 polypeptide or, more 
preferably, with a secreted IL-21 polypeplide-expressing cell. Such cells may be 
cultured in any suitable tissue culture medium; however, it is preferable to culture cells 
in Earle's modified Eagle's medium supplemented with 1 0% fetal bovine serum 
(inactivated at about 56" C), and supplemented with about 10 g/1 of nonessential amino 
acids, about 1,000 U/ml of penicillin, and about 100 ug/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; however, it is preferable to employ the parent myeloma cell line 
(SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands and colleagues (Gastroenxerology 80:225-232 (1981 )). The 
hybridoma cells obtained through such a selection are then assayed to identify clones 
which secrete antibodies capable of binding the IL-21 polypeptide. 

Alternatively, additional antibodies capable of binding to IL-21 polypeptide can 
be produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 
mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody 
whose ability to biud to the IL-21 protein-specific antibody can be blocked by EL-21 . 
Such antibodies comprise anti-idiotypic antibodies to the IL-21 protein-specific 
antibody and can be used to immunize an animal to induce formation of further IL-21 
protein -specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the antibodies 
of the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain 
(to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, 
secreted IL-21 protein-binding fragments can be produced through the application of 
recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced using 
genetic constructs derived from hybridoma cells producing the monoclonal antibodies 
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described above. Methods for producing chimeric antibodies are known in the art (see, 
for review, Morrison, Science 229:1202 (1985); Oi, et al, BioTechniques 4:214 
(1986); Cabilly, et al, U.S. Patent No. 4,816,567; Tanigucht, etal, EP 171496; 
Morrison, etal, EP 173494; Neuberger, etal, WO 8601533; Robinson, et al t WO 
8702671; Boulianne, etal, Nature 312:643 (1984); Neuberger, etal. Nature 
314:268 (1985)). 

Example 11: Production Of IL-21 Protein For High-Throughput 
Screening Assays 

The following protocol produces a supernatant containing IL-21 polypeptide to 
be tested. This supernatant can then be used in the screening assays described 
subsequently in Examples 13-20. 

First, dilute poIy-D-lysine (644 587 Boehringer-Mannheim) stock solution 
(lmg/ml in PBS) 1:20 in PBS (Phosphate Buffered Saline; w/o calcium or magnesium 
17-516F Biowhittaker) for a working solution of 50 ng/ml Add 200 ul of this solution 
to each well (24 well plates) and incubate at RT for 20 minutes. Be sure to distribute 
the solution over each well (note: a 12-channel pipetter may be used with tips on every 
other channel). Aspirate the poly-D-lysine solution and rinse with 1 ml PBS. The PBS 
should remain in the well until just prior to plating the cells and plates may be 
poly-lysine coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10 5 cells/well in 0.5 ml 
DMEM (Dulbecco's Modified Eagle Medium) supplemented with 4.5 G/L glucose, 
L-glutamine (12-604F Biowhittaker)), 10% heat inactivated FBS (14-503F 
Biowhittaker), and Ix Penstrep (17-602E Biowhittaker). Let the cells grow overnight. 

Following overnight incubation, mix together in a sterile solution basin: 300 ul 
Lipofectamine (18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL) in 
each well of a 96-welI plate. With a small volume multi-channel pipetter, aliquot 
approximately 2 ug of an expression vector containing a polynucleotide insert, 
produced by the methods described in Examples 8 or 9, into an appropriately labeled 
96- well round bottom plate. With a multi-channel pipetter, add 50 ul of the 
Lipofectamine/Optimem 1 mixture to each well. Pipette up and down gently to mix. 
Incubate at RT for 15-45 minutes. After about 20 minutes, use a multi-channel pipetter 
to add 1 50 ul Optimem I to each well. As a control, one plate of vector DNA lacking an 
insert should be transfected with each set of transfections. 

Preferably, the transfection should be performed by simultaneously performing 
the following tasks in a staggered fashion. Thus, hands-on time is cut in half, and the 
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cells are not excessively incubated in PBS. First, person A aspirates the media from 
four 24-well plates of cells, and then person B rinses each well with 0.5-lml PBS. 
Person A then aspirates the PBS rinse, and person B, using a 12-channel pipetter with 
tips on every other channel, adds the 200 ul of DNA/Lipofectamine/Optimem J complex 
5 to the odd wells first, then to the even wells, to each row on the 24-well plates. Plates 
are then incubated at 37°C for 6 hours. 

While cells are incubating, the appropriate media is prepared: either 1% BSA in 
DMEM with lx penstrep, or HGS CHO-5 media (116.6 mg/L of CaCl 2 (anhyd); 
0.00130 mg/L CuS0 4 -5H 2 0; 0.050 mg/L of Fe(N0 3 ) 3 -9H 2 0; 0.417 mg/L of 
10 FeS0 4 -7H 2 0; 3 1 1 .80 mg/L of KC1; 28,64 mg/L of MgCl 2 ; 48.84 mg/L of MgS0 4 ; 
6995.50 mg/L of Nad; 2400.0 mg/L of NaHC0 3 ; 62.50 mg/L of NaH 2 PO 4 -H 2 0; 
71.02 mg/L of Na 2 HP0 4 ; .4320 mg/L of ZnS0 4 -7H 2 0; .002 mg/L of Arachidonic 
g Acid; 1 .022 mg/L of Cholesterol; 0.070 mg/L of D-L-alpha-Tocopherol-Acetate; 

□ 0.0520 mg/L of Linoleic Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of Myristic 
ffl 1 5 Acid; 0.010 mg/L of Oleic Acid; 0.0 1 0 mg/L of Palmitric Acid; 0.0 1 0 mg/L of Palmitic 
y Acid; 100 mg/L of Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 
£ 4551 mg/L of D-Glucose; 130.85 mg/ml of L-Alanine; 1 47.50 mg/ml of 

3 L-Arginine-HCL; 7.50 mg/ml of L-Asparagine-H 2 0; 6.65 mg/ml of L-Aspartic Acid; 

□ 29.56 mg/ml of L-Cystine-2HCL-H 2 0; 3 1.29 mg/ml of L-Cystine-2HC1; 7.35 mg/m! of 
W 20 L-Glutamic Acid; 365.0 mg/ml of L-Glutamine; 1 8.75 mg/ml of Glycine; 52.48 mg/ml 
[g of L-Histidine-HCL-H 2 0; 1 06.97 mg/ml of L-Isolcucine; 1 1 1 .45 mg/ml of L-Leucine; 
*B 163.75 mg/ml of L-Lysine HCL; 32.34 mg/ml of L-Methionine; 68.48 mg/ml of 

^ L-Phenylalainine; 40.0 mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of 

L-Threonine; 19.22 mg/ml of L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H 2 0; 

25 and 99.65 mg/ml of L- Valine; 0.0035 mg/L of Biolin; 3.24 mg/L of D-Ca Pantothenate; 
1 1.78 mg/L of Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Tnositol; 
3.02 mg/L of Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.031 mg/L of Pyridoxine 
HCL; 0.319 mg/L of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of 
Thymidine; 0.680 mg/L of Vitamin B, 2 ; 25 mM of HEPES Buffer; 2.39 mg/L of Na 

30 Hypoxanthine; 0.105 mg/L of Lipoic Acid; 0.081 mg/L of Sodium Putrescine-2HCL; 
55.0 mg/L of Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM of 
Ethanolamine; 0.122 mg/L of Ferric Citrate; 41.70 mg/L of Melhyl-B-Cyclodextrin 
complexed with Linoleic Acid; 33.33 mg/L of Methyl-B-Cyclodextrin complexed with 
Oleic Acid; 10 mg/L of Methyl-B-Cyclodextrin complexed with Retinal Acetate. Adjust 

35 osmolarity to 327 mOsm) with 2mm glutamine and 1 x penstrep. (BSA (81-068-3 

Bayer) lOOgm dissolved in 1L DMEM for a 10% BSA stock solution). Filter the media 
and collect 50 ul for endotoxin assay in 15ml polystyrene conical. 
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The transfection reaction is terminated, again, preferably by two people, at the 
end of the incubation period. Person A aspirates the transfection media, while person B 
adds 1.5 ml of the appropriate media to each well. Incubate at 37°C for 45 or 72 hours, 
depending on the media used (1%BSA for 45 hours or CHO-5 for 72 hours). 

On day four, using a 300 ul multichannel pipetter, aliquot 600 ul in one 1ml 
deep well plate and the remaining supernatant into a 2 ml deep well. The supernatants 
from each well can then be used in the assays described in Examples 1 3-20. 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the IL-21 
polypeptide directly (e.g., as a secreted protein) or by IL-21 inducing expression of 
other proteins, which are then secreted into the supernatant. Thus, the invention further 
provides a method of identifying the protein in the supernatant characterized by an 
activity in a particular assay. 

Example 12: Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and proliferation 
of cells is called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs 
pathway bind to gamma activation site ("GAS") elements or interferon-sensitive 
responsive element ("TSRE"), located in the promoter of many genes. The binding of a 
protein to these elements alter the expression of the associated gene. 

GAS and IS RE elements are recognized by a class of transcription factors called 
Signal Transducers and Activators of Transcription, or "STATs." There are six 
members of the STATs family. Statl and Stat3 are present in many cell types, as is 
Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many cell types though it has been found in T-helper class I, cells after treatment with 
IL-12. Stat5 was originally called mammary growth factor, but has been found at 
higher concentrations in other cells including myeloid cells. It can be activated in tissue 
culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus upon 
tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, 
Jakl, Jak2 t and Jak3. These kinases display significant sequence similarity and are 
generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table 
below (adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621-51 
(1995))). A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, TL-4, TL-6, IL-7, IL-9, IL-1 1. 
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1L-12, IL-15, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF ? and throxnbopoietin; 

and (b) Class 2 includes IFN-a, EFN-7, and lL-10. The Class 1 receptors share a 

conserved cysteine motif (a set of four conserved cysteines and one tryptophan) and a 

WSXWS motif (a membrane proximal region encoding Trp-Ser-Xxx-Trp-Ser (where 

5 "Xxx" represents any amino acid; SEQ ID NO: 14)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 

activate STATs, which then translocate and bind to GAS elements. This entire process 

is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 

1 0 the GAS or the ISRE element, can be used to indicate proteins involved in the 

proliferation and differentiation of cells. For example, growth factors and cytokines are 

known to activate the Jaks-STATs pathway (see Table below). Thus, by using GAS 

p elements linked to reporter molecules, activators of the Jaks-STATs pathway can be 

P identified. 
05 15 

>j ISRE 
* Ligand 

□ 1FN family 

20 IFN-a/B 

□ IFN-g 
in (IRFl>Lys6>lFP) 

fU 11-10 
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Growth hormone family 

GH ? - + - 5 

PRL ? +/-+- 1.3,5 

EPO ? - + - 5 
G AS(B-C AS>I RF 1 =1 FP»Ly 6) 

Receptor Tyrosine Kinases 

EGF ? + + - 1,3 GAS(IRFl) 

PDGF ? + + - 1,3 

CSF-1 ? + + - 1,3 GAS (not IRFi) 



To construct a synthetic GAS containing promoter element, which is used in the 

15 biological assays described in Examples 13-14, a PGR based strategy is employed to 
generate a GAS-SV40 promoter sequence. The 5* primer contains four tandem copies 

P of the GAS-binding site found in the IRFI promoter and previously demonstrated to 

D bind STATs upon induction with a range of cytokines (Rothman, et ai t Immunity 

jjjjj 1:457-468 (1994)), although other GAS or 1SRE elements can be used instead. The 5* 

yj 20 primer also contains 1 8 bp of sequence complementary to the S V40 early promoter 

£ sequence and is flanked with an Xho I restriction site. The sequence of the 5' primer is: 

P 5-GCG CCT CGA GAT 1TC CCC GAA ATC TAG ATT TCC CCG AAA TGA TTT 

□ CCC CGA A AT GAT TTC CCC GAA ATA TCT GCC ATC TC A ATT AG-3' (SEQ 

j£j ID NO: 15). 

$ 25 The downstream primer is complementary to the S V40 promoter and is flanked 

£ with a Hin dill site: 5'-GCG GCA AGC TTT TTG CAA AGC CTA GGC-3' (SEQ TD 



NO: 16). 

PCR amplification is performed using the S V40 promoter template present in 
the B-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
30 digested with Xho I and Hin dill and subcloned into BLSK2- (Stratagene). 

Sequencing with forward and reverse primers confirms that the insert contains the 
following sequence: 

CTCGAGAITTCCCCGAAATCTAGATT^ 
CAGCAACCATAXHCCC^^ 
35 CTAATTTTTTTTATTTATCCAGA^ 

CCTAQGCTTTTGCAAAfiAQCII (SEQ ID NO: 17). 

With this GAS promoter element linked to the SV40 promoter, a G AS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted alkaline 
phosphatase, or "SEAP". Clearly, however, any reporter molecule can be instead of 
40 SEAP, in this or in any of the other Examples. Well known reporter molecules that can 
be used instead of SEAP include chloramphenicol acetyltransferase (CAT), luciferase, 
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alkaline phosphatase, B-galuctosidase, green fluorescent protein (GFP), or any protein 
detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hin dill and 
Xho I, effectively replacing the SV40 promoter with the amplified GAS:SV40 promoter 
element, to create the GAS-SEAP vector. However, this vector does not contain a 
neomycin resistance gene, and therefore, is not preferred for mammalian expression 
systems. 

Thus, in order to generate mammalian stable cell lines expressing the 
GAS-SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector 
using Sal I and Not 1, and inserted into a backbone vector containing the neomycin 
resistance gene, such as pGFP-1 (Clontech), using these restriction sites in the multiple 
cloning site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 
marrimalian cells, this vector can then be used as a reporter molecule for GAS binding 
as described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS 
with a different promoter sequence. For example, construction of reporter molecules 
containing NF-kB and EGR promoter sequences are described in Examples 15 and 16. 
However, many other promoters can be substituted using the protocols described in 
these Examples. For instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be 
substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, 
II-2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 
construct activity, such as HeLa (epithelial), HUVEC (endothelial), Reh (B-cell), 
Saos-2 (osteoblast), HUVAC (aortic), or Cardiomyocyte. 

Example 13: High-Throughput Screening Assay for T-ccIl Activity, 

The following protocol is used to assess T-cell activity of IL-2 1 by determining 
whether IL-21 supernatant proliferates and/or differentiates T-cells. T-cel! activity is 
assessed using the GAS/SEAP/Neo construct produced in Example 12. Thus, factors 
that increase SEAP activity indicate the ability to activate the Jaks-STATS signal 
transduction pathway. The T-cell used in this assay is Jurkat T-cells (ATCC Accession 
No. TIB-152), although Molt-3 cells (ATCC Accession No. CRL-1552) and Molt-4 
cells (ATCC Accession No. CRL-1582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are transfected with the 
GAS-SEAP/neo vector using DMRIE-C (Life Technologies; transfection procedure 
described below). The transfected cells are seeded to a density of approximately 
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20,000 cells per well and transfectants resistant to 1 mg/ml genticin selected. Resistanl 
colonies are expanded and then tested for their response to increasing concentrations of 
interferon gamma. The dose response of a selected clone is demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
5 containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 

generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in RPMI 
+ 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life Technologies) 
with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM containing 50 ul 
of DMR1E-C and incubate at room temperature for 15-45 min. 
10 During the incubation period, count cell concentration, spin down the required 

number of cells (10 7 per transfection), and resuspend in OPTI-MEM to a final 
concentration of 10 7 cells/ml. Then add 1ml of 1 x 10 7 cells in OPTI-MEM to T25 flask 
p and incubate at 37°C for 6 hrs. After the incubation, add 10 ml of RPMI + 15% serum. 

B The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 

fjj 15 serum, I mg/ml Genticin, and 1 % Pen-Strep, These cells are treated with supernatants 

(jy containing IL-21 polypeptides or 1L-21 induced polypeptides as produced by the 

£ protocol described in Example 1 1 . 

, On the day of treatment with the supernatant, the cells should be washed and 

O resuspended in fresh RPMI + 10% serum to a density of 500,000 ceils per ml. The 

jjj 20 exact number of cells required will depend on the number of supernatant s being 

& screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 

*f million cells) are required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the cells into 
a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 200 ul 
25 of cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supernatants are transferred 
directly from the 96 well plate containing the supernatants into each well using a 12 
channel pipette. In addition, a dose of exogenous interferon gamma (0.1 T 1.0, 10 ng) 
is added to wells H9, H10, and HI 1 to serve as additional positive controls for the 
30 assay. 

The 96 well dishes containing Jurkat cells treated with supernatants arc placed in 
an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul samples 
from each well are then transferred to an opaque 96 well plate using a 12 channel 
pipette. The opaque plates should be covered (using sellophene covers) and stored at 
35 -20"C until SEAP assays are performed according to Example 17. The plates 
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containing the remaining treated ceJls are placed at 4°C and serve as a source of material 
for repeating the assay on a specific well if desired. 

As a positive control, 100 Unit/ml interferon gamma can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 

Example 14: High-Throughput Screening Assay Identifying Myeloid 
Activity 

The following protocol is used to assess myeloid activity of IL-21 by 
determining whether IL-21 proliferates and/or differentiates myeloid cells. Myeloid cell 
activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks-STATS 
signal transduction pathway. The myeloid cell used in this assay is U937, a 
pre-monocyte cell line, although TF-1, HL60, or KG1 can be used. 

To transiendy transfect U937 cells with the GAS/SEAP/Neo construct produced 
in Example 12, a DEAE-Dextran method (Kharbanda, et aL, Cell Growth & 
Differentiation, 5:259-265 (1994)) is used. First, harvest 2xl0 7 U937 cells and wash 
with PBS. The U937 cells are usually grown in RPMI 1640 medium containing 10% 
heat- inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin 
and 100 mg/ml streptomycin. 

Next, suspend the cells in 1 ml of 20 mM Tris-HCi (pH 7.4) buffer containing 
0.5 mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCl, 5 mM 
KC1, 375 uM NajHPOJHjO, 1 mM MgCl 2 , and 675 uM CaCl,. Incubate at 37°C for 
45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
resuspend in 10 ml complete medium and incubate at 37"C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
u.g/m1 G41 8. The G41 8-free medium is used for routine growth but every one to two 
months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting IxlO 8 cells (this is enough for ten 96-weli 
plates assay) and wash with PBS. Suspend the cells in 200 ml above described growth 
medium, with a final density of 5x10 s cells/ml. Plate 200 ul cells per well in the 
96-well plate (or IxlO 5 cells/well). 

Add 50 \il of the supernatant prepared by the protocol described in Example 1 1 . 
Incubate at 37'C for 48 to 72 hr. As a positive control, 100 U/ml interferon gamma can 
be used which is known to activate U937 cells. Over 30-fold induction is typically 
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observed in the positive control wells. SEAP assay the supernatant according to the 
protocol described in Example 17. 

Example 15: High-Throughput Screening Assay Identifying Neuronal 
Activity. 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these genes, 
EGR1 (early growth response gene 1), is induced in various tissues and cell types upon 
activation. The promoter of EGR 1 is responsible for such induction. Using the EGR1 
promoter linked to reporter molecules, activation of cells can be assessed by IL-21. 

Particularly, the following protocol is used to assess neuronal activity in PCI 2 
cell lines. PCI 2 cells (rat phenochromocytoma cells) are known to proliferate and/or 
differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl 
phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). The 
EGR1 gene expression is activated during this treatment. Thus, by stably transfecting 
PC 12 cells with a construct containing an EGR promoter linked to SEAP reporter, 
activation of PC 1 2 cells by IL-21 can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following protocol. 
The EGR-1 promoter sequence (nucleotides -633 to +1; Sakamoto, K., et aL, 
Oncogene 6:867-871 (1991)) can be PCR amplified from human genomic DNA using 
the following primers: (A) 5' Primer: 5-GCG CTC GAG GGA TGA CAG CGA TAG 
AAC CCC GG-3* (SEQ ID NO: 18) and (B) 3' Primer: 5'-GCG AAG CTT CGC GAC 
TCC CCG GAT CCG CCT C-3' (SEQ TD NO: 19). 

Using the GAS:SEAP/Neo vector produced in Example 12, EGR1 amplified 
product can then be inserted into this vector. Linearize the GAS:SEAP/Neo vector 
using restriction enzymes Xho I and Hin dill, removing the GAS/SV40 stuffer 
fragment. Digest the EGR1 amplified product with the same enzymes. Li gate the 
vector and the EGR1 promoter. 

To prepare 96 well-plates for cell culture, 2 ml of a coating solution (1:30 
dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 1 0 cm plate or 50 ml per well of the 96-well plate, and 
allowed to air dry for 2 hr. 

PCI 2 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) 
containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% 
heat-inactivated fetal bovine scrum (FBS) supplemented with 100 units/ml penicillin 
and 100 ug/ml streptomycin on a precoated 10 cm tissue culture dish. A 1:4 split is 
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done every three lo four days. Cells are removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PC12 using the Lipofectamine 
protocol descrihed in Example 11. EGR-SEAP/PC12 stable cells are obtained by 
growing the cells in 300 tig/ml G418. The G418-free medium is used for routine 
growth but every one to two months, the cells should be re-grown in 300 ug/ml G41 8 
for several passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS. 
Then starve the cells in low serum medium (RPMI-1640 containing 1% horse serum 
and 0.5% FBS with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape 
off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 
the cell number and add more low serum medium to reach final cell density as 5x10 s 
cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
lxlO 5 cells/well). Add 50 ul supernatant produced by Example 1 1 , 37° C for 48 to 72 
hr. As a positive control, a growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold 
induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

Example 16: Hiyh-Throughput Screening Assay for T-cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide variety 
of agents including the inflammatory cytokines IL-1 and TNF, CD30 and CD40, 
lymphotoxin-a and lymphotoxin-p, by exposure to LPS or thrombin, and by 
expression of certain viral gene products. As a transcription factor, NF-jcB regulates 
the expression of genes involved in immune cell activation, control of apoptosis (NF- 
kB appears to shield cells from apoptosis), B- and T-cell development, anti- viral and 
antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with 1-icB 
(Inhibitor tcB). However, upon stimulation, I- kB is phosphoryiated and degraded, 
causing NF- kB to shuttle to the nucleus, thereby activating transcription of target 
genes. Target genes activated by NF- kB include IL-2, iL-6, GM-CSF, ICAM-1 and 
class 1 MHC. 
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Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB promoter element are used to screen the supernatants 
produced in Example 1 1 . Activators or inhibitors of NF-kB would be useful in treating 
diseases. For example, inhibitors of NF-kB could be used to treat those diseases 

5 related to the acute or chronic activation of NF-kB, such as rheumatoid arthritis. 

To construct a vector containing the NF-kB promoter element, a PCR based 
strategy is employed. The upstream primer contains four tandem copies of the NF-kB 
binding site (5'-GGG GAG TTT CCC-3'; SEQ ID NO:20), 1 8 bp of sequence 
complementary to the 5' end of the SV40 early promoter sequence, and is flanked with 

10 an Xho I site: 5'-GCG GCC TCG AGG GGA CTT TCC CGG GGA CTT TCC GGG 
GAG TTT CCG GGA CTT TCC ATC CTG CCA TCT CA A TTA G-3' (SEQ ID 
NO:21). 

^ The downstream primer is complementary to the 3* end of the SV40 promoter 

p and is flanked with a Hin dIU site: 5 -GCG GCA AGC TTT TTG CAA AGC CTA 

S3 15 GGC-3' (SEQ ID NO: 22). 

y PCR amplification is performed using the S V40 promoter template present in 

the pB-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
P digested with Xho I and Hin dm and subcloned into BLSK2- (Stratagene). 

p Sequencing with the T7 and T3 primers confirms the insert contains the following 

W 20 sequence: 

\SS 5 ' -CTCGAGGGGACTTTCCCGG^^ 

w TCCCTCCCCTAACIXXGCCCATCCCC^ 

ATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCT 

25 caaaaagctt-3 • (SEQ TD NO:23) 

Next, replace the SV40 minimal promoter element present in the 
pSEAP2-promoter plasmid (Clontech) with this NF-kB/S V40 fragment using Xho I 
and Hin dill. However, this vector does not contain a neomycin resistance gene, and 
30 therefore, is not preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, the NF-kB/SV40/SEAP 
cassette is removed from the above NF-kB/SEAP vector using restriction enzymes Sal I 
and Not I, and inserted into a vector containing neomycin resistance. Particularly, the 
NF-kB/SV40/SEAP cassette was inserted into pGFP-1 (Clontech), replacing the GFP 
35 gene, after restricting pGFP-1 with Sal I and Not I. 

Once NF-xB/SV40/SEAP/Neo vector is created, stable Jurkat T-cclls are created 
and maintained according to the protocol described in Example 13. Similarly, the 
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method for assaying supernatants with these stable Jurkat T-cells is aiso described in 
Example 13. As a positive control, exogenous TNF-a (0.1 J, 10 ng) is added to wells 
H9, H10, and Hl l, with a 5-10 told activation typically observed. 

Example 17: Assay for SEAP Activity 

As a reporter molecule for the assays described in Examples 13-16, SEAP 
activity is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the 
following general procedure. The Tropix Phospho-light Kit supplies the Dilution, 
Assay, and Reaction Buffers used beiow. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 ul of 2.5x 
dilution buffer into Optiplates containing 35 ul of a supernatant. Seal the plates with a 
plastic sealer and incubate at 65°C for 30 min. Separate the Optiplates to avoid uneven 
heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 
prime with the Assay Buffer. Add 50 uJ Assay Buffer and incubate at room 
temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
table below). Add 50 uJ Reaction Buffer and incubate at room temperature for 20 
minutes. Since the intensity of the chemilumincscenl signal is time dependent, and it 
takes about 10 minutes to read 5 plates on luminometer, one should treat 5 plates at each 
time and start the second set 10 minutes later. 

Read the relative light unit in the luminometer. Set 1 112 as blank, and print the 
results. An increase in chemiluminescence indicates reporter activity. 



Reaction Buffer Formulation: 



# of plates 


Rxn buffer diluent (ml) 


CSPD (ml) 


10 


60 


3 


11 


65 


3.25 


12 


70 


3.5 


13 


75 


3.75 


14 


80 


• 4 


15 


85 


4.25 


16 


. 90 


4.5 


17 


95 


4.75 


18 


100 


5 


19 


105 


5.25 


20 


110 


5.5 


21 


115 


5.75 


22 


120 


6 


23 


125 


6.25 


24 


130 


6.5 


25 


135 


6.75 


26 


140 


7 


27 


145 


7.25 
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28 


150 


7.5 


29 


155 


7.75 


30 


160 


8 


31 


165 


8.25 


32 


170 


8.5 


33 


175 


8.75 


34 


180 


9 


35 


185 


9.25 


36 


190 


9.5 


37 


195 


9.75 


38 


200 


10 


39 


205 


10.25 


40 


210 


10.5 


41 


215 


10.75 


42 


220 


11 


43 


225 


11.25 


44 


230 


11.5 


45 


235 


11.75 


46 


240 


12 


47 


245 


12.25 


48 


250 


12.5 


49 


255 


12.75 


50 


260 


13 



Example 18; High-Throughput Screening Assay Identifying Changes in 
Small Molecule Concentration and Memhrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supernatants which 
bind to receptors of a particular cell. Although the following protocol describes an 
assay for calcium, this protocol can easily be modified to detect changes in potassium, 
sodium, pH, membrane potential, or any other small molecule which is detectable by a 
fluorescent probe. 

The following assay uses Fluoromctric Imaging Plate Reader ("FL1PR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-3, used here. 

For adherent cells, seed the cells at 10,000-20,000 cells/well in a Co-star black 
96-welI plate with clear bottom. The plate is incubated in a C0 2 incubator for 20 hours. 
The adherent cells are washed two times in Biotek washer with 200 ul of HBSS 
(Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 

A stock solution of 1 mg/ml fluo-3 is made in 10% pluronic acid DMSO. To 
load the cells with fluo-3, 50 *d of 12 ^g/ml fluo-3 is added to each well. The plate is 
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incubated at 37 "C in a C0 2 incubator for 60 rnin. The plate is washed four times in the 
Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5xl0 6 ceils/ml with HBSS in a 50-ml conical tube. Four ul of 1 
5 mg/ml fluo-3 solution in 10% pluronic acid DMSO is added to each 1 ml of cell 

suspension. The tube is then placed in a 37°C water bath for 30-60 rnin. The cells are 
washed twice with HBSS, resuspended to lxlO 6 cells/ml, and dispensed into a 
microplate, 100 ui/well. The plate is centrifuged at 1000 rpm for 5 min. The plate is 
then washed once in Denley Cell Wash with 200 ul, followed by an aspiration step to 
10 100 ul final volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such as 
fluo-3. The supernatant is added to the well, and a change in fluorescence is detected. 
□ To measure the fluorescence of intracellular calcium, the FI JPR is set for the 

9 following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 

\j 1 5 second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and 

(6) Sample addition is 50 ul. Increased emission at 530 nm indicates an extracellular 
p signaling event caused by the a molecule, either IL-21 or a molecule induced by 1L-21, 

f which has resulted in an increase in the intracellular Ca 2+ concentration. 

O 

in 

[y 20 Example 19: High- Throughput Screening Assay Identifying Tyrosine 

v0 Kinase Activity 

J3 

jg The Protein Tyrosine Kinases (PTK) represent a diverse group of 

transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase 
RPTK) group are receptors for a range of mitogenic and metabolic growth factors 

25 including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. In 
addition there are a large family of RFHCs for which the corresponding ligand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor dimerization, 

30 resulting in transphosphorylation of the receptor subunits and activation of the 
cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include receptor 
associated tyrosine kinases of the sre-family (e.g., sre, yes, lck t lynjyn) and 
non-receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, 
members of which mediate signal transduction triggered by the cytokine superfamily .of 

35 receptors (e.g., the Tnterleukins, Interferons, GM-CSF, and Leptin). 
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Because of the wide range of known factors capable of stimulating tyrosine 
kinase activity, identifying whether 1L-21 or a molecule induced by IL-21 is capable of 
activating tyrosine kinase signal transduction pathways is of interest. Therefore, the 
following protocol is designed to identify such molecules capable of activating the 
tyrosine kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses with 
100% ethanol, rinsed with water and dried overnight. Some plates are coated for 2 hr 
with 100 ml of cell culture grade type T collagen (50 mg/ml), gelatin (2%) or polylysine 
(50 mg/ml), all of which can be purchased from Sigma Chemicals (St. Louis, MO) or 
10% Matrigel purchased from Becton Dickinson (Bedford,MA), or calf serum, rinsed 
with PBS and stored at 4°C. Cell growth on these plates is assayed by seeding 5,000 
cells/well in growth medium and indirect quantitation of cell number through use of 
alamar Blue as described by the manufacturer Alamar Biosciences, Inc. (Sacramento, 
CA) after 48 hr. Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are 
used to cover the Loprodyne Silent Screen Plates. Falcon Microtest 111 cell culture 
plates can also be used in some proliferation experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of 
Loprodyne plates (20,000/200ml/well) and cultured overnight in complete medium. 
Cells are quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 
minutes, treatment with EGF (60ng/ml) or 50 ul of the supernatant produced in 
Example 1 1, the medium was removed and 100 ml of extraction buffer ((20 mM 
HEPES pH 7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM 
Na4P207 and a cocktail of protease inhibitors (# 1836170) obtained from Boeheringer 
Mannheim (Indianapolis, IN) is added to each well and the plate is shaken on a rotating 
shaker for 5 minutes at 4 0 C. The plate is then placed in a vacuum transfer manifold and 
the extract filtered through the 0.45 mm membrane bottoms of each well using house 
vacuum. Extracts are collected in a 96-well catch/assay plate in the bottom of the 
vacuum manifold and immediately placed on ice. To obtain extracts clarified by 
centrifugation, the content of each well, after detergent solubilization for 5 minutes, is 
removed and centrifuged for 15 minutes at 4*C at 16,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 

Generally, the tyrosine kinase activity of a supernatant is evaluated by 
determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide). Biolinylated peptides that can be used for this purpose include 
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PSK1 (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in 
order. First, add 10 ul of 5 uM Biotinylated Peptide, then 10 ul ATP/Mg 1 " (5 mM 
ATP/50 mM MgCl 2 ), then 10 ul of 5x Assay Buffer (40 mM imidazole hydrochloride, 
pH 73, 40 mM p-glycerophosphate, 1 mM EGTA, 100 mM MgCl 2 , 5 mM MnCl 2 , 0.5 
mg/ml BSA), then 5 ui of Sodium Vanadate (1 mM), and then 5 ul of water. Mix the 
components gently and preincubate the reaction mix at 30*C for 2 min. Initial the 
reaction by adding 10 ul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120 mm 
EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 
mixture to a microtiter plate (MTP) module and incubating at 37°C for 20 min. This 
allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
Wash the MTP module with 300 ul/well of PBS four times. Next add 75 ul of 
anti-phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr-POD 
(0.5 ul/ml)) to each well and incubate at 37"C for one hour. Wash the well as above. 

Next add 100 ul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 min (up to 30 min). Measure the absorbance 
of the sample at 405 nm by using EL1SA reader. The level of bound peroxidase activity 
is quantitated using an ELIS A reader and reflects the level of tyrosine kinase activity. 

Example 20: High-Throughput Screening Assay Identifying 
Phosphorylation Activity 

As a potential alternative and/or compliment to the assay of protein tyrosine 
kinase activity described in Example 19, an assay which detects activation 
(phosphorylation) of major intracellular signal transduction intermediates can also be 
used. For example, as described below one particular assay can detect tyrosine 
phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), MEK. kinase, 
Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be delected by 
substituting these molecules for Erk-1 or Erk-2 in the following- assay. 

Specifically, assay plates are made by coating the wells of a 96- well ELIS A 
plate with 0. 1 ml of protein G (1 ug/ml) for 2 hr at room temp (RT). The plates are 
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then rinsed with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G 
plates arc then treated with 2 commercial monoclonal antibodies (100 ng/well) against 
Erk-1 and Erk-2 (1 hr at RT; available from Santa Cruz Biotechnology). To detect 
other molecules, this step can easily be modified by substituting a monoclonal antibody 
delecting any of the above described molecules. After 3-5 rinses with PBS, the plates 
are stored at 4*C until use. 

A431 cells are seeded at 20,000/well in a 96-well Loprodyne filterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6 ng/well) or 50 ul of the supernatants 
obtained in Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts 
filtered direcdy into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinase (10 ng/well) is used in place 
of A431 extract. Plates are then treated with a commercial polyclonal (rabbit) antibody 
(1 ug/ml) which specifically recognizes the phosphorylated epitope of the Erk-1 and 
Erk-2 kinases (1 hr at RT). This antibody is biotinylatcd by standard procedures. The 
bound polyclonal antibody is then quantitated by successive incubations with 
Europium-streptavidin and Europium fluorescence enhancing reagent in the Waliac 
DELFIA instrument (time-resolved fluorescence). An increased fluorescent signal over 
background indicates a phosphorylation by EL-21 or a molecule induced by IL-21. 

Example 21: Method of Determining Alterations in the IL-21 Gene 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDN A is then generated from 
these RNA samples using protocols known in the art (see, Sambrook, et ai, supra) 
The cDNA is then used as a template for PCR, employing primers surrounding regions 
of interest in SEQ ID NO: 1. Suggested PCR conditions consist of 35 cycles at 95"C 
for 30 seconds; 60-120 seconds at 52-58 8 C; and 60-120 seconds at 70*C, using buffer 
solutions described by Sidransky and colleagues (Science 252:706 (1991)). 

PCR products are then sequenced using primers labeled at the 5' end with T4 
polynucleotide kinase, employing SequiTherm Polymerase (Epicentre Technologies). 
The intron^exon borders of selected exons of TL-21 are also determined and genomic 
PCR products analyzed to confirm the results. PCR products harboring suspected 
mutations in IL-21 are then cloned and sequenced to validate the results of the direct 
sequencing. 

PCR products of IL-21 arc cloned into T-tailed vectors as described by Holton 
and Graham (Nucl Acids Res. 19:1 156 (1991)) and sequenced with T7 polymerase 
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(United States Biochemical). Affected individuals are identified by mutations in IL-21 
not present in unaffected individuals. 

Genomic rearrangements are also observed as a method of determining 
alterations in the IL-21 gene. Genomic clones isolated according to Example 2 are 
nick-translated with digoxigenindeoxy-uridine 5-triphosphate (Boehringer Manheim), 
and FISH performed as described by Johnson and coworkers (Methods Cell Biol. 
35:73-99 (1991)). Hybridization with the labeled probe is carried out using a vast 
excess of human cot-1 DNA for specific hybridization to the IL-21 genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phcnylidole and 
propidium iodide, producing a combination of C- and R-bands. Aligned images for 
precise mapping are obtained using a triple-band filter set (Chroma Technology, 
Brattleboro, VT) in combination with a cooled charge-coupled device camera 
(Photometries, Tucson, A2) and variable excitation wavelength filters (Johnson, C, et 
al, Genet. Anal Tech. Appi 8:75 (1991)). linage collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC). Chromosome alterations of 
the genomic region of IL-21 (hybridized by the probe) are identified as insertions, 
deletions, and translocations. These TL-21 alterations are used as a diagnostic marker 
for an associated disease. 

Example 22: Method of Detecting Abnormal Levels of IL-21 in a 
Biological Sample 

IL-21 polypeptides can be detected in a biological sample, and if an increased or 
decreased level of IL-21 is detected, this polypeptide is a marker for a particular 
phenotypc. Methods of detection are numerous, and thus, it is understood that one 
skilled in the art can modify the following assay to fit their particular needs. 

For example, antibody-sandwich ELISAs are used to detect IL-21 in a sample, 
preferably a biological sample. Wells of a microtitcr plate are coated with specific 
antibodies to IL-21 r at a final concentration of 0.2 to 10 ug/ml. The antibodies are 
either monoclonal or polyclonal and are produced by the method described in Example 
10. The wells are blocked so that non-specific binding of IL-21 to the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing TL-21. Preferably, serial dilutions of the sample should be used to validate 
results. The plates are then washed three times with de ionized or distilled water to 
remove unbounded IL-2 1 . 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate/at a 
concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
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The plates are again washed three times with deionized or distilled water to remove 
unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 
phosphate (NPP) substrate solution to each well and incubate 1 hour at room 
temperature. Measure the reaction by a microti ter plate reader. Prepare a standard 
curve, using serial dilutions of a control sample, and plot IL-21 polypeptide 
concentration on the X-axis (log scale) and fluorescence or absorbance of the Y-axis 
(linear scale). Interpolate the concentration of the IL-21 in the sample using the 
standard curve. 

Example 23: Formulating a Polypeptide 

The IL-21 composition will be formulated and dosed in a fashion consistent 
with good medical practice, taking into account the clinical condition of the individual 
patient (especially the side effects of treatment with the IL-21 polypeptide alone), the 
site of delivery, the method of administration, the scheduling of administration, and 
other factors known to practitioners. The "effective amount" for purposes herein is 
thus determined by such considerations. 

As a general proposition, the total pharmaceutical Iy effective amount of IL-21 
administered parenterally per dose will be in the range of about 1 ug/kg/day to 10 
mg/kg/day of patient body weight, although, as noted above, this will be subject to 
therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and most 
preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. Tf given 
continuously, IL-21 is typically administered at a dose rate of about 1 ug/kg/hour to 
about 50 u,g/kg/hour, either by 1-4 injections per day or by continuous subcutaneous 
infusions, for example, using a mini-pump. An intravenous bag solution may also be 
employed. The length of treatment needed to observe changes and the interval 
following treatment for responses to occur appears to vary depending on the desired 
effect. 

Pharmaceutical compositions containing IL-21 are administered orally, rectally, 
parenterally, intracistemally, intravaginally, intraperitoneal ly, topically (as by powders, 
ointments, gels, drops or transdermal patch), bucally, or as an oral or nasal spray. 
"Pharmaceutically acceptable carrier" refers to a non-toxic solid, semisolid or liquid 
filler, diluent, encapsulating material or formulation auxiliary of any type. The term 
"parenteral" as used herein refers to modes of administration which include 
intravenous, intramuscular, intraperitoneal, intrasternal, subcutaneous and intraarticular 
injection and infusion. 
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IL-21 is also suitably administered by sustained- release systems. Suitable 
examples of sustained-release compositions include semi-permeable polymer matrices 
in the form of shaped articles, e.g., films, or mirocapsules. Sustained-release matrices 
include polylactides (U.S. Pat. No. 3,773,919, EP 58,481), copolymers of L-glutamic 
acid and gamma-ethyl-L-g!utamate (Sidman, U., etal, Biopolymers 22:547-556 
(1983)), poly (2- liydroxyethyl methacrylate) (Langer, R., et aL, J. Biomed. Mater. 
Res. 15:167-277 (1981); Langer, R. Chem. Tech. 12:98-105 (1982)), ethylene vinyl 
acetate (Langer, R. ( et aL) or poly-D- (-)-3-hydroxybutyric acid (EP 133,988). 
Sustained-release compositions also include liposomally entrapped TL-21 polypeptides. 
Liposomes containing the IL-21 are prepared by methods known per se (DE 3,21 8,121 ; 
Epstein, et aL, Proc. Natl. Acad. Sci. USA 82:3688-3692 (1985); Hwang, et aL, 
Proc. Natl. Acad. Sci. USA 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; 
EP 143,949; EP 142,641; Japanese Pat. Appl. 83-1 18008; U.S. Pat. Nos. 4,485,045 
and 4,544,545; and EP 102,324). Ordinarily, the liposomes are of the small (about 
200-800 Angstroms) unilamellar type in which the lipid content is greater than about 30 
mol. percent cholesterol, the selected proportion being adjusted for the optimal secreted 
polypeptide therapy. 

For parenteral administration, in one embodiment, IL-21 is formulated generally 
by mixing it at the desired degree of purity, in a unit dosage injectable form (solution, 
suspension, or emulsion), with a pharmaceutically acceptable carrier, i.e., one that is 
non-toxic to recipients at the dosages and concentrations employed and is compatible 
with other ingredients of the formulation. For example, the formulation preferably does 
not include oxidizing agents and other compounds that are known to be deleterious to 
polypeptides. 

Generally, the formulations are prepared by contacting TL-21 uniformly and 
intimately with liquid carriers or finely divided solid carriers or both. Then, if 
necessary, the product is shaped into the desired formulation. Preferably the carrier is a 
parenteral carrier, more preferably a solution that is isotonic with the blood of the 
recipient. Examples of such carrier vehicles include water, saline, Ringer's solution, 
and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl oleate are 
also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, 
citrate, succinate, acetic acid, and other organic acids or their salts; antioxidants such as 
ascorbic acid; low molecular weight (less than about ten residues) polypeptides, e.g., 
polyarginine or tripeptides; proteins, such as serum albumin, gelatin, or 



immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids, 
such as glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, 
disaccharides, and other carbohydrates including cellulose or its derivatives, glucose, 
manose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or 
sorbitol; counterions such as sodium; and/or nonionic surfactants such as polysorbates, 
poloxamers, or PEG. 

IL-21 is typically formulated in such vehicles at a concentration of about 0. 1 
mg/ml to 100 mg/ml, preferably 1-10 mg/mi, at a pH of about 3 to 8. It will be 
understood that the use of certain of the foregoing excipients, carriers, or stabilizers will 
result in the formation of polypeptide salts. 

IL-21 used for therapeutic administration can be sterile. Sterility is readily 
accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron 
membranes). Therapeutic polypeptide compositions generally are placed into a 
container having a sterile access port, for example, an intravenous solution bag or vial 
having a stopper pierceablc by a hypodermic injection needle. 

IL-21 polypeptides ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 
formulation for reconstitution. As an example of a lyophilized formulation, 10-ml vials 
are filled with 5 ml of sterile-filtered 1% (w/v) aqueous IL-21 polypeptide solution, and 
the resulting mixture is lyophilized. The infusion solution is prepared by reconstituting 
the lyophilized IL-21 polypeptide using bacteriostatic Water-For-Injection (WFJ). 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such container(s) can be a notice in the 
form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. In addition, IL-21 may be 
employed in conjunction with other therapeutic compounds. 

Example 24: Method of Treating Decreased Levels of IL-21 

The present invention relates to a method for treating an individual in need of a 
decreased level of IL-21 activity in the body comprising, administering to such an 
individual a composition comprising a therapeutically effective amount of IL-21 
antagonist. Preferred antagonists for use in the present invention are IL-2 1 -specific 
antibodies. 

Moreover, it will be appreciated that conditions caused by a decrease in the 
standard or normal expression level of IL-21 in an individual can be treated by 
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administering EL-21, preferably in the secreted form. Thus, the invention also provides 
a method of treatment of an individual in need of an increased level of IL-21 
polypeptide comprising administering to such an individual a pharmaceutical 
composition comprising an amount of 1L-21 to increase the activity level of IL-21 in 
5 such an individual. 

For example, a patient with decreased levels of IL-2 1 polypeptide receives a 
daily dose 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 23. 

10 

Example 25: Method of Treating Inc reased Levels of IL-21 

The present invention also relates to a method for treating an individual in need 
of an increased level of IL-21 activity in the body comprising administering to such an 
individual a composition comprising a therapeutically effective amount of IL-21 or an 
15 agonist thereof. 

Anlisense technology is used to inhibit production of IL-21. This technology is 
one example of a method of decreasing levels of TL-21 polypeptide, preferably a 
secreted form, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnormally increased levels of TL-21 is 
20 administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 2.0 and 3.0 
mg/kg day for 21 days. This treatment is repeated after a 7-day rest period if the 
treatment was well tolerated. The formulation of the and sense polynucleotide is 
provided in Example 23. 

25 Example 26: Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing IL-21 polypeptides, onto a patient. Generally, fibroblasts are obtained from 
a subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunks of the tissue arc placed on a wet surface of a 

30 tissue culture flask, approximately ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed to 
the bottom of the flask and fresh media (e.g., Ham's F12 media, with 10% FRS, 
penicillin and streptomycin) is added. The flasks are then incubated at 3TC for 

35 approximately one week. 



84 



At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T., et al, DNA 7:219-25 (1988)), flanked by the long 
terminal repeats of the Moloney murine sarcoma virus, is digested with Eco RT and Hin 
din and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding 1L-21 can be amplified using PGR primers which 
correspond to the 5' and 3' end sequences respectively as set forth in Example I . 
Preferably, the 5' primer contains an Eco RI site and the 3' primer includes a Hin dill 
site. Equal quantities of the Moloney murine sarcoma virus linear backbone and the 
amplified Eco RI and Hin dEQ fragment are added together, in the presence of T4 DNA 
ligase. The resulting mixture is maintained under conditions appropriate for ligation of 
the two fragments. The ligation mixture is then used to transform bacteria HB 101, 
which are then plated onto agar containing kanamycin for the purpose of confirming 
that the vector contains properly inserted 1L-21 . 

The amphotropic pA3 17 or GP+aml2 packaging cells are grown in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (T>MEM) with 10% 
calf serum (CS), penicillin and streptomycin. The MSV vector containing the 1L-21 
gene is then added to the media and the packaging cells transduced with the vector. The 
packaging cells now produce infectious viral particles containing the TL-21 gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the 
media is harvested from a 10 cm plate of confluent producer cells. The spent media, 
containing the infectious viral particles, is filtered through a millipore filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the media 
from the producer cells. This media is removed and replaced with fresh media. If the 
titer of virus is high, then virtually all fibroblasts will be infected and no selection, is 
required. If the titer is very low, then it is necessary to use a retroviral vector that has a 
selectable marker, such as neo or his. Once the fibroblasts have been efticiendy 
infected, the fibroblasts are analyzed to determine whether IL-21 protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to confluence on cytodex 3 microcarrier beads. 

It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. Numerous modifications and 
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variations of the present invention are possible in light of the above teachings and, 
therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or other 
disclosures) in the Background of the Invention, Detailed Description, and Examples 
hereby incorporated herein by reference. 
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Sequence Listing 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Ebner, Reinhard 
Ruben, Steven M. 

(ii) TITLE OF INVENTION: Interleukins 21 and 22 

(iii) NUMBER OF SEQUENCES: 27 

<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: MD 

(E) COUNTRY : US 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
{B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA : 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Brookes, A. Anders 

(B) REGISTRATION NUMBER: 36,373 

(C) REFERENCE/DOCKET NUMBER: PF470PP 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 301-309-8504 

(B) TELEFAX: 301-309-8439 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 703 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNE S S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GGCACGAGTG OACACGGATG AGGACCGCTA TCCACAGAAG CTGGCCTTCG CCGAGTOCCT 60 
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GTGCAGAGGC TGTATCGATG CACGGACGGG CCGCGAGACA GCTGCGCTCA ACTCCGTGCG 120 

GCTGCTCCAG AGCCTGCTGG TGCTGCGCCG CCGGCCCTGC TCCCGCGACG GCTCGGGGCT 180 

CCCCACACCT GGGGCCTTTG CCTTCCACAC CX1AGTTCATC CACGTCCCCG TCGGCTGCAC 240 

CTGCGTGCTG CCCCGTTCAG TGTGACCCCC AAGGCCGTGG GGCCCTTAGA CTGGACACGT 300 

GTGCTCCCCA GAGGGCACCC CCTATTTATG TGTATTTATT GTTATTTATA 1GCCTCCCCC 360 

AACACTACCC TTGGGGTCTG GGCATTCCCC GTGTCTGGAG GACAGCCCCC CACTGTTCTC 420 

CTCATCTCCA GCCTCAGTAG TTGGGGGTGA AGGAGCTCAG CACCTCTTCC AGCCCTTAAA 480 

GCTGCAGAAA AGGTGTCACA CGGCTGCCTG TACCTTGGTC CCTGTCCTGC TCCCGGCITC 540 

CCTTACCCTA TCACTGGCCT CAGGCCCCCG CAGGCTGCCT CTTCCCAACC TCCTTGGAAG 600 

TACCCCTGTT TCTTAAACAA TTATTTAAGT GTACGTGTAT TATTAAACTG A1X3AACACAA 660 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 703 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRI PTTON ; SEQ ID NO: 2; 

Ala Arg Val Asp Thr Asp Glu Asp Arg Tyr Pro Gin Lys Leu Ala Phe 
15 10 15 

Ala Glu Cys Leu Cys Arg Gly Cys He Asp Ala Arg Thr Gly Arg Glu 
20 25 30 

Thr Ala Ala Leu Asn Ser Val Arg Leu Leu Gin Ser Leu Leu Val Leu 
35 40 45 

Arg Arg Arg Pro Cys Ser Arg Asp Gly Ser Gly Leu Pro Thr Pro Gly 
50 55 60 

Ala Phe Ala Phe His Thr Glu Phe He His Val Pro Val Gly Cys Thr 
65 70 75 80 

Cys Val l«u Pro Arg Ser Val 
85 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

<M LENGTH; 1642 base pairs 
(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGAATTCGGC ACGAGCTCGT GCCGTGCTCA GTGCCTTCCA CCACACGCIX3 CAGCTGGGGC 60 

CGCGTGAGCA GGCGCGCAAC GCGAGCTGCC CGGCAGGGGG CAGGCCCGCC GACOGCCGCT 120 

TCCGGCCGCC CACCAACCTG CGCAGCGTGT CGCCCTGGGC CTACAGAATC TCCTACGACC 180 

CGGCGAGGTA CCCCAGGTAC CTGCCTGAAG CCTACTGCCT GTGCCGGGGC TGCCTGACCG 240 

GGCTGTTCGG CGAGGAGGAC GTGCGCTTCC GCAGCGCCCC TGTCTACATG CCCACCGTCG 300 

01 TCCTGCGCCG CACCCCCGCC TGCGCCGGCG GCCGTTCCGT CTACACCGAG GCCTACGTCA 360 



Q CCATCCCCGT GGGCTGCACC TGCGTCCCCG AGCCGGAGAA GGACGCAGAC AGCATCAACT 420 

m 

Si 
uJ 

Q GAGCCGCCTG GAGGGCTCGG TCGGCGACCT CTGAAGAGAG TGCACCGAGC AAACCAAGTG 600 
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CCAGCATCGA CAAACAGGGC GCCAAGCTCC TGCTGGGCCC CAACGACGCG CCCGCTGGCC 480 
CCTGAGGCCG GTCCTGCCCC GGGAGGTCTC CCCGGCCCGC ATCCCGAGGC GCCCAAGCTG S40 



Q CCGGAGCACC AGCGCCGCCT TTCCATGGAG ACTCGTAAGC AGCTTCATCT GACAOGGGCA 660 

U1 



tccctggctt GCTTTTAGCT ACAAGCAAGC AGCGTGGCTG GAAGCTGATG GGAAACGACC 720 

CGGCACGGGC ATCCTGTGTG CGGCCCGCAT GGAGGGTTTG GAAAAGTTCA CGGAGGCTCC 780 

CTGAGGAGCC TCTCAGATCG GCTGCTGCGG GTGCAGGGCG TGACTCACCG CTGGGTGCTT 840 

GCCAAAGAGA TAGGGACGCA TATGCTTTTT AAAGCAATCT AAAAATAATA ATAAGTATAG 900 

CGACTATATA CCTACTTTTA AAATCAACTG TTTTGAATAG AGGCAGAGCT ATTTTATATT 960 

ATCAAATGAG AGCTACTCTG TTACATTTCT TAACATATAA ACATCGTTTT TTACTTCTTC 1020 

TGGTAGAATT TTTTAAAGCA TAATTGGAAT CCTTGGATAA ATTTTGTAGC TGGTACACTXJ 1080 

TGGCCTGGGT CTCTGAATTC AGCCTGTCAC CGATGGCTGA CTGATGAAAT GGACACGTCT 1140 

CATCTGACCC ACTCTTCCTT CCACTGAAGG TCTTCACGGG CCTCCAGGTG GACCAAAGGG 1200 

ATGCACAGGC GGCTCGCATC CCCCAGGGCC AGCTAAGAGT TCCAAAGATC TCAGATTTGG 1260 

TTTTAGTCAT GAATACATAA ACAGTCTCAA ACTCGCACAA TTTTTTCCCC CriTTGAAAG 1320 

" CCACTCQGGC CAATTTGTCG TTAAGAOGTG GTGAGATAAG AAGTGGAACG TGACATCTTT 1380 

GCCAGTTGTC AGAAGAATCC AAGCAGGTAT TGGCTTAGTT GTAAGGGCTT TAGGATCAGG 1440 

CTGAATATGA GGACAAAGTG GGCCACGTTA GCATCTGCAG AGATCAATCT GGAGGCTTCT 1500 

GTTTCTGCAT TCTGCCACGA GAGCTAGGTC CTTGATCTTT TCTTTAGATT GAAAGTC1X3T 1560 
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CTCTGAACAC AATTAITTGT AAAAGTTAGT AGTTCTTTTT TAAATCATTA AAAGAGGCTT 1620 
GCTGAAAAAA AAAAAAAAAA AA 1642 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



{xi> SEQUENCE DESCRIPTION : SEQ ID NO: 4: 

Asn Ser Ala Arg Ala Arg Ala Val Leu Ser.Ala Phe His His Thr I»eu 
1.5 10 15 

Gin Leu Gly Pro Arg Glu Gin Ala Arg Asn Ala Ser Cys Pro Ala Gly 
20 25 30 

Gly Arg Pro Ala Asp Arg Arg Phe Arg Pro Pro Thr Asn Leu Arg Ser 
35 40 45 

Val Ser Pro Trp Ala Tyr Arg lie Ser Tyr Asp Pro Ala Arg Tyr Pro 
50 55 60 

Arg Tyr Leu Pro Glu Ala Tyr Cys Leu Cys Arg Gly Cys Leu Thr Gly 
65 70 75 80 

Leu Phe Gly Glu Glu Asp Val Arg Phe Arg Ser Ala Pro Val Tyr Mec 
85 90 95 

Pro Thr Val Val Leu Arg Arg Thr Pro Ala Cys Ala Gly Gly Arg Ser 
100 105 110 

Val Tyr Thr Glu Ala Tyr Val Thr lie Pro Val Gly Cys Thr Cys Val 
115 120 125 

Pro Glu Pro Glu Lys Asp Ala Asp Ser lie Asn Ser Ser He Asp Lys 
130 135 . 140 

Gin Gly Ala Lys Leu Leu Leu Gly Pro Asn Asp Ala Pro Ala Gly Pro 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 5: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 
(R) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(11) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Thr Pro Gly Lys Thr Ser Leu Val Ser Leu Leu Leu Leu Leu Ser 
15 10 15 

Leu Glu Ala He Val Lys Ala Gly Tie Thr He Pro Arg Asn Pro Gly 
20 25 30 

cys Pro Asn Ser Glu Asp Lys Asn Phe Pro Arg Thr Val Met Val Asn 
35 40 45 

Leu Asn He His Asn Arg Asn Thr Asn Thr Asn Pro Lys Arg Ser Ser 
50 55 60 

Asp Tyr Tyr Asn Arg Ser Thr Ser Pro Trp Asn Leu His Arg Asn Glu 
65 70 75 80 

Asp Pro Glu Arg Tyr Pro Ser Val He Trp Glu Ala Lys Cys Atg His 
85 90 95 

Leu Gly Cys He Asn Ala Asp Gly Asn Val Asp Tyr His Met Asn Ser 
100 105 110 

Val Pro He Gin Gin Glu He Leu Val Leu Arg Arg Glu Pro Pro His 
115 120 125 

Cys Pro Asn Ser Phe Arg Leu Glu Lys He Leu Val Ser Val G]y Cys 
130 135 140 

Thr Cys Val Thr Pro He Val His His Val Ala 
145 150 155 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



{xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6; 

Met Ser Pro Gly Arg Ala Ser Ser Val Ser Leu Met Leu Leu Leu Leu 
1 5 10 15 

Leu Ser Leu Ala Ala Thr Vol Lys Ala Ala Ala He He Pro Gin Ser 
20 25 30 

Ser Ala Cys Pro Asn Thr Glu Ala Lys Asp Phe Leu Gin Asn Val Lys 
35 40 45 



Val Asn Lou Lys Val Phe Asn Ser Leu Gly Ala Lys Vol Ser Sor Arg 
50 55 60 
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Arg Pro Ser Asp Tyr Leu Asn Arg Ser Thr Ser Pro Trp Thr Leu His 
65 70 75 80 

Arg Asn Glu Asp Pro Asp Arg Tyr Pro Ser Val lie Trp Glu Ala Gin 
85 90 95 

Cys Arg His Gin Arg Cys Val Asn Ala Glu Gly Lyc Leu Asp His His 
100 105 110 

Met Asn Ser Val Leu He Gin Gin Glu He Leu Val Leu Lys Arg Glu 
115 120 125 

Pro Glu Ser Cys Pro Phe Thr Phe Arg Val Glu Lys Met Leu Val Gly 
130 135 140 

Val Gly Cys Thr Cys Val Ala Ser He Val Arg Gin Ala Ala 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

{C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Thr Phe Arg Met Thr Ser Leu Val Leu Leu Leu Leu Leu Ser He 
1 5 10 15 

Asp Cys He Val Lys Ser Glu He Thr Ser Ala Gin Thr Pro Arg Cys 
20 25 30 

Leu Ala Ala Asn Asn Ser Phe Pro Arg Ser Val Met Val Thr Leu Ser 
35 40 45 

He Arg Asn Trp Asn Thr Ser Ser Lys Arg Ala Ser Asp Tyr Tyr Asn 
50 55 60 

Arg Ser Thr Ser Pro Trp Thr Leu His Arg Asn Glu Asp Gin Asp Arg 
65 70 75 80 

Tyr Pro Ser Val He Trp Glu Ala Lys Cys Arg Tyr Leu Gly Cys Val 
85 90 95 

Asn Ala Asp Gly Asn Val Asp Tyr His Met Asn Ser Val Pro He Gin 
100 105 110 

Gin Glu He Leu Val Val Arg Lys Gly His Gin Pro Cys Pro Asn Ser 
115 120 125 



Phe Arg Leu Glu Lys Met Leu Val Thr Val Gly Cys Thr Cys Val Thr 
130 135 140 
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Pro He Val His Asn Val Asp 
145 150 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D> TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Asp Trp Pro His Asn Leu Leu Phe Leu Leu Thr He Ser He Phe 
15 10 15 

l*u Gly Leu Gly Gin Pro Arg Ser Pro Lys Ser Lys Arg Lys Gly Gin 
20 25 30 

Gly Arg Pro Gly Pro Leu Ala Pro Gly Pro His Gin Val Pro Leu Asp 
35 40 45 

Leu Val Ser Arg Met Lys Pro Tyr Ala Arg Met Glu Glu Tyr Gin Arg 
50 55 60 

Asn He Glu Glu Met Val Ala Gin Leu Arg Asn Ser Ser Glu Leu Ala 
65 70 75 80 

Gin Arg Lys Cys Glu Val Asn Leu Gin Leu Trp Met Ser Asn Lys Arg 
85 90 95 

Ser Leu Ser Pro Trp Gly Tyr Ser lie Asn His Asp Pro Ser Arg He 
100 105 110 

Pro Val Asp Leu Pro Glu Ala Arg Cys Leu Cys Leu Gly Cys Val Asn 
115 120 125 

Pro Phe Thr Met Gin Glu Asp Arg Ser Met Val Ser Val Pro Val Phe 
130 135 140 

Ser Gin Val Pro Val Arg Arg Arg Leu Cys Pro Pro Pro Pro Arg Thr 
145 150 • 155 160 

Gly Pro Cys Arg Gin Arg Ala Val Met Glu Thr He Ala Val Gly Cys 
165 170 175 

Thr Cys He Phe 
180 

i2) INFORMATION FOR SEQ ID NO : 9 : 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GATCGCGGAT CCGACACGGA TGAGGACCGC TATCCACAGA AGCTG 
<2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 : 
CCCAAGCTTT CACACTGAAC GGGGCAGCAC GCAGGTGCAG C 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ii} MOLECULE TYPE; DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CGCCGCGGAT CCGCCATCCG CACGAGTGGA CACGG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

. (ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGCGGTACCC ACTGAAOGGG GCAGCACGC 
(2) INFORMATION FOR SEQ ID NO: 13: 
(i> SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 733 base pairs 

{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGGATCOGGA GCCCAAATCT TCTGACAAAA CTCACACATG CCCACCGTGC CCAGCACCTG 60 

AATTCGAGGG TGCACCGTCA GTCTTCCTCT TCCCCCCAAA ACCCAAGGAC ACCCTCATGA 120 

TCTCCCGGAC TCCTGAGGTC ACATGCGTGG TGGTGGACGT AAGCCAOGAA GACCCTGAGG 180 

TCAAGTTCAA CTGGTACGTG GACGGCGTGG AGGTGCATAA TGCCAAGACA AAGCCGCGGG 240 

AGGAGCAGTA CAACAGCACG TACCGTGTGG TCAGCGTCCT CACCGTCCTG CACCAGGACT 300 

GGCTGAATGG CAAGGAGTAC AAGTGCAAGG TCTCCAACAA AGCCCTCCCA ACCCCCATCG 360 



cn 

O 

D 

as 

SI AGAAAACCAT CTCCAAAGCC AAAGGGCAGC CCCGAGAACC ACAGGTCTAC ACCCTGCCCC 420 

W 

£ CATCCCGGGA TGAGCTGACC AAGAACCAGG TCAGCCTGAC CTGCCTGGTC AAAGGCTTCT 480 

b 



ATCCAAGCGA CATCGCCGTG GAGTGGGAGA GCAATGGGCA GCCGGAGAAC AACTACAAGA 540 
CCACGCCTCC CGTGCTGGAC TCCGACGGCT CCTTCTTCCT CTACAGCAAG CTCACCGTGG 600 



□ 
W 

fy ACAAGAGCAG GTGGCAGCAG GGGAACGTCT TCTCATGCTC CGTGATOCAT GAGGCTCTGC 660 

ACAACCACTA CACGCAGAAG AGCCTCTCCC TGTCTCCGGG TAAATGAGTG CGACGGCCGC 720 
^ GACTCTAGAG GAT 733 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi> SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

Trp Ser Xaa Trp Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCGCCTCGAG ATTTCCCCGA AATCTAGATT TCCCCGAAAT GATTTCCCCG AAATGATTTC 60 
CCCGAAATAT CTGCCATCTC AATTAG 86 
(2) INFORMATION FOR SEQ TO NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

EH 

O (ii) MOLECULE TYPE: DNA (genomic) 

D 

03 
M 
U 

•P 

D 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16 : 
GCGGCAAGCT TTTTGCAAAG CCTAGGC 
(2) INFORMATION FOR SEQ ID NO: 17: 



c 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTCGAGATTT CCCCGAAATC TAGATTTCCC CGAAATGATT TCCCCGAAAT CATTTCCCCG 60 

AAATATCTGC CATCTCAATT AGTCAGCAAC CATAGTCCCG CCCCTAACTC CGCCCATCCC 120 

GCCCCTAACT CCGCCCAGTT CCGCCCATTC TCCGCCCCAT GGCTGACTAA TTTTTTTTAT 180 

TTATGCAGAG GCCGAGGCCG CCTCGGCCTC TGAGCTATTC CAGAAGTAGT GAGGAGGCTT 240 

TTTTGGAGGC CTAGGCTTTT GCAAAAAGCT T 271 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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< i± > MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCGCTCGAGG GATGACAGCG ATAGAACCCC GG 
(2) INFORMATION FOR SEQ ID NO: 19: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{Li) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCGAAGCTTC GCGACTCCCC GGATCCGCCT C 
<2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 
GGGGACTTTC CC 

(2) INFORMATION FOR SEQ ID NO: 21: 

(X) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCGGCCTCGA GGGGACTTTC CCGGGGACTT TCCGGGGACT TTCCGGGACT TTCCATCCTG 
CCATCTCAAT TAG 

(2) INFORMATION FOR SEQ ID NO: 22: 
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(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLE CUT, E TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:22: 
GCGGCAAGCT TTTTGCAAAG CCTAGGC 27 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE aiARACTERISTICS : 

(A) LENGTH: 256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
D (D) TOPOLOGY: linear 

D 

£J <ii) MOLECULE TYPE: DNA (genomic) 

W 

"P (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

P 

8 CTCGAGGGGA CTTTCCCGGG GACT1TCCGG GGACTTTCCG GGACTTTCCA 'L'CTGCCATCT 60 

C 

CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC 120 



vQ 

130 



CAGTTCCGCC CA1TCTCCGC CCCATGGCTG ACTAATTTTT TTTATTTATG CAGAGGCCGA 180 

GGCCGCCTCG GCCTCTGAGC TATTCCAGAA GTAGTGAGGA GGCTTTTTTG GAGGCCTTAGG 240 

CTTTTGCAAA AAGCTT 256 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

NAATTCGGCA NAGGGNGAAA CGACCCGGCA NGCGATNCCT GTGTGCCGCC CGCATGGAGG 60 

GTTTGGAAAA GTTCACGGNG GCTCCCTGAG GACCTGCGAG AATCGGGCTG CTGCGGGTCC 120 

AAGGCGTGGA CTCACCGCTG GGTGCTTGCC AAANAGGATA GGGACGCATA TGCTITTTAA 180 

AGCAATCl'AA AAATAATAAT AAGTATAGCG ACTATATACC TACTTTTAAA ATCAACTGTT 240 
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TTGAATAGAG GCAGAGCTTA TTTTATATTA TCCAAATGAG AGCTACTCTG TTNACATPiT 300 

CTTTAAACAT TTTAAACATN GOTTTTTTNA CTTCTTNCTG GGTNGGATTT TTTTTAAAGG 360 

CNTAATTGGG A 371 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

tA) LENGTH: 498 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

m 

p AATTCGGCAG AGCCAGNCCG GAGAAGGACG CAC5ACAGCAT CAACTCCAGC ATCGACAAAC 60 

D 
OS 
S4 
W 

«P GGNTTCGGTC GGCGACTCTG AAGAGAC3TNC ACCGAGCAAA CCAAGTGCCG GAGCAACAGC 240 



□ 



AGGGCGCCAA GCTCCTGCTG GGCCCCAACG ACGCGCCCGC TGGCSCCTGA AGGNCGGTTC 120 
CTGCCCCGGG AGGTCTCCCC GGNCCCGCAT CCCGAGGCGC CCAAGC1X3GA GCCGCCTGGA 1.00 



GUCGNCTTTT NCATGGAGAT TCGTAAGCAN TTI1CATTTG ACANGGGGAT CCCTGGTTTG 300 
TTTTTAGTTA CAAGCAAGCA NNTGGNTTGA AGTNGNTGGG GAAAGGANCC CNAGGGATTC 360 
TGTNTTNGGG GCCNTNTGGA GGGTTTTGGA AAATTTNAGG GQGTTNCTGN OGGTTTTTTA 420 



Q 

m 

ru 

s3 ANATTGGNTT TTTTTAGGGT TNAAGGGTNN NTTAACTTGG GNGTTTTTCN AANNGTTGGG 480 

CO 

GGATTNTTTT TNAAGATT 498 
(2) INFORMATION FOR SEQ ID NO: 26: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) M0IJ3CULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CCTTCAGCAA GCCTCTTTTA ATGATTTAAA AAAGAACTTC TAACTTTTAC AAATAATTGT 60 

. GTTCAGAGAC AGACTTTCAA TCTAAAGAAA AGATCAAGGN CCTAGCTCTN GTGGCAGAAT 120 

GCAGAAACAG AAGCCNCCAG ATNGANCTCN GCAGATGCTA ACGNGGCCCA CTTTGTCC 3 78 
(2) INFORMATION FOR SEQ ID NO: 27: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 264 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GGCAGAGCCA AGCTCCTGCT GGUGCCCCAA CGACGCGCCC GCTGGCCCCT AAGGCCGGTT 60 

CCTGCCCCGG AAGGTCTCCC CGGCCCGCAT CCCGAGGCGC CCAAGCTGGA GCCGCCTGGA 120 

CK3GCTTCGGT NOGGCGAACC TCTGAAAGAG AAGTGCCACC GAGCAAACCA AGTGCCGGTA 160 

GCACCAGNGC CGCCTTTCCA TQGAGACTCG TAAGCAGCTT CATCTGANAC GGGAATCCCT 240 

Q GGTTTGCTTT TAGCTACAAG CAAG 264 

D 
CO 
SJ 
W 
•P 
D 



S 
W 



PPI' 11 ! 11 
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What Is Claimed Is; 

1 . An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) a polynucleotide fragment of SEQ ID NO: 1 or a polynucleotide 
fragment of the cDNA sequence included in ATCC Deposit No: 209666; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID 
NO:2 or the cDNA sequence included in ATCC Deposit No: 209666; 

(c) a polynucleotide encoding conserved polypeptide domain I of 
SEQ ID NO:2 or the cDNA sequence included in ATCC Deposit No: 209666; 

|JJ . (d) a polynucleotide encoding conserved polypeptide domain II of 

p SEQ ID NO:2 or the cDNA sequence included in ATCC Deposit No: 209666; 

OS (e) a polynucleotide encoding conserved polypeptide domain m of 

y SEQ ID NO:2 or the cDNA sequence included in ATCC Deposit No: 209666; 

£ (f) a polynucleotide encoding conserved polypeptide domain IV of 

P SEQ ID NO:2 or the cDNA sequence included in ATCC Deposit No: 209666; 

p (g) a polynucleotide encoding a polypeptide epitope of SEQ ID 

W NO:2 or the cDNA sequence included in ATCC Deposit No: 209666; 

fD 

£ (h) a polynucleotide encoding a polypeptide of SEQ ID NO:2 or the 

tfi cDNA sequence included in ATCC Deposit No: 209666 having biological activity; 

(i) a polynucleotide which is a variant of SEQ ID NO: 1 ; 
(j) a polynucleotide which is an allelic variant of SEQ ID NO: 1 ; 
(k) a polynucleotide which encodes a species homologue of the 
polypeptide whose amino acid sequence is shown in SEQ ID NO:2; and 

(1) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(k), wherein said 
polynucleotide does not hybridize under stringent conditions to a nucleic acid molecule 
having a nucleotide sequence of only A residues or of only T residues. 

2. An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) a polynucleotide fragment of SEQ I D NO:3 or a polynucleotide 
fragment of the cDNA sequence included in ATCC Deposit No: 209665; 
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(b) a polynucleotide encoding a polypeptide fragment of SEQ ID 
NO:4 or the cDNA sequence included in ATCC Deposit No: 209665; 

(c) a polynucleotide encoding conserved polypeptide domain I of 
SEQ ID NO:4 or the cDNA sequence included in ATCC Deposit No: 209665; 

(d) a polynucleotide encoding conserved polypeptide domain U of 
SEQ ID NO:4 or the cDNA sequence included in ATCC Deposit No: 209665; 

(e) a polynucleotide encoding conserved polypeptide domain 111 of 
SEQ ID NO:4 or the cDNA sequence included in ATCC Deposit No: 209665; 

(f) a polynucleotide encoding conserved polypeptide domain IV of 
SEQ ID NO:4 or the cDNA sequence included in ATCC Deposit No: 209665; 

(g) a polynucleotide encoding a polypeptide epitope of SEQ ID 
NO:4 or the cDNA sequence included in ATCC Deposit No: 209665; 

(h) a polynucleotide encoding a polypeptide of SEQ ID NO:4 or the 
cDNA sequence included in ATCC Deposit No: 209665 having biological activity; 

(i) a polynucleotide which is a variant of SEQ ID NO:3; 

(j) a polynucleotide which is an allelic variant of SEQ ID NO:3; 

(k) a polynucleotide which encodes a species homologue of the 
polypeptide whose amino acid sequence is shown in SEQ ID NO:4; and 

(1) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(k), wherein said 
polynucleotide does not hybridize under stringent conditions to a nucleic acid molecule 
having a nucleotide sequence of only A residues or of only T residues. 

3 . The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a mature form or a 
secreted protein. 

4 . The isolated nucleic acid molecule of claim 2, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a mature form or a 
secreted protein. 

5 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO:2 or the coding sequence included in ATCC Deposit No: 
209666. 
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6. The isolated nucleic acid molecule of claim 2, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO:4 or the coding sequence included in ATCC Deposit No: 
209665. 

7 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:l or 
the cDNA sequence included in ATCC Deposit No: 209666. 

8 . The isolated nucleic acid molecule of claim 2, wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:3 or 
the cDN A sequence included in ATCC Deposit No: 209665. 

9. The isolated nucleic acid molecule of claims 3 or 4, wherein the 
nucleotide sequence comprises sequential nucleotide deletions from either the 
C-terminus or the N-terminus. 

10. The isolated nucleic acid molecule of claims 5 or 6, wherein the 
nucleotide sequence comprises sequential nucleotide deletions from either the 
C-terminus or the N-terminus. 

11. A recombinant vector comprising the isolated nucleic acid molecule of 
claims 1 or 2. 

12. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1 . 

13. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 2. 

14. A recombinant host cell produced by the method of claim 12. 

15. A recombinant host cell produced by the method of claim 1 3. 

16. The recombinant host cell of claim 14 comprising vector sequences. 
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1 7 . The recombinant host cell of claim 1 5 comprising vector sequences. 

18. An isolated polypeptide comprising an amino acid sequence at least 95% 
identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO:2 or the encoded 
sequence included in ATCC Deposit No: 209666; 

(b) a polypeptide fragment of SEQ ID NO:2 or the encoded 
sequence included in ATCC Deposit No: 209666 having biological activity; 

(c) a polypeptide domain of SEQ TD NO: 2 or the encoded sequence 
included.in ATCC Deposit No: 209666; 

(d) a polypeptide epitope of SEQ ID NO: 2 or the encoded sequence 
included in ATCC Deposit No: 209666; 

(e) a mature form of a secreted protein; 

(f) a full length secreted protein; 

(g) a variant of SEQ I D NO:2; 

(h) an allelic variant of SEQ ID NO:2; or 

(i) a species homologue of the SEQ ID NO:2. 

19. An isolated polypeptide comprising an amino acid sequence at least 95% 
identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ TD NO:4 or the encoded 
sequence included in ATCC Deposit No: 209665; 

(b) a polypeptide fragment of SEQ ID NO:4 or the encoded 
sequence included in ATCC Deposit No: 209665 having biological activity; 

(c) a polypeptide domain of SEQ ID NO:4 or the encoded sequence 
included in ATCC Deposit No: 209665; 

(d) a polypeptide epitope of SEQ ID NO:4 or the encoded sequence 
included in ATCC Deposit No: 209665; 

(e) a mature form of a secreted protein; 
1 (0 a full length secreted protein; 

(g) a variant of SEQ ID NO:4; 

(h) an allelic variant of SEQ ID NO:4; or 

(i) a species homologue of the SEQ ID NO:4. 



20. The isolated polypeptide of claims 1 8 or 19, wherein the mature form or 
the full length secreted protein comprises sequential amino acid deletions from either the 
C-terminus or the N-terminus. 
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21. An isolated antibody that binds specifically to the isolated polypeptide of 
claims 18 or 19. 

22. A recombinant host cell that expresses the isolated polypeptide of claim 

18. 

23 . A recombinant host cell that expresses the isolated polypeptide of claim 

19. 

24. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 22 under conditions 
such that said polypeptide is expressed; and 

(b) recovering said polypeptide. 

25. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 23 under conditions 
such that said polypeptide is expressed; and 

(b) recovering said polypeptide. 

26. The polypeptide produced by claims 24 or 25. 

27 . A method for preventing, treating, or ameliorating a medical condition 
which comprises administering to a mammalian subject a therapeutically effective 
amount of the polypeptide of claims 18 or 19 or of the polynucleotide of claims 1 or 2. 

28. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject related to expression or activity of a secreted protein 
comprising: 

(a) determining the presence or absence of a mutation in the 
polynucleotide of claims 1 or 2; 

(b) diagnosing a pathological condition or a susceptibility to a 
pathological condition based on the presence or absence of said mutation. 

29. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject related to expression or activity of a secreted protein 
comprising: 
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(a) determining the presence or amount of expression of the 
polypeptide of claims 18 or 19 in a biological sample; 

(b) diagnosing a pathological condition or a susceptibility to a 
pathological condition based on the presence or amount of expression of the 
polypeptide. 

30. A method for identifying binding partner to the polypeptide of claims 18 
or 19 comprising: 

(a) contacting the polypeptide of claims 18 or 19 with a binding 

partner; and 

(b) determming whether the binding partner effects an activity of the 

polypeptide. 

31. The gene corresponding to the cDNA sequence of SEQ ID NO:2. 

32. The gene corresponding to the cDNA sequence of SEQ ID NO:4. 

33 . A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:l in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 

34. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:3 in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 

3 5 . The product produced by the method of claim 33 . 



36. The product produced by the method of claim 34. 
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Abstract 

The present invention relates to novel human proteins designated Tnterleukin-21 
(IL-21) and Interleukin-22 (TL-22), and isolated polynucleotides encoding these 
proteins. Also provided are vectors, host cells, antibodies, and recombinant methods 
for producing these human proteins. The invention further relates to diagnostic and 
therapeutic methods useful for diagnosing and treating disorders related to these novel 
human proteins. 



Figure 1 
Inter leukin- 21 



1 GGC ACGAGKXSACACGGATGAGGACCGCTA^ 6 0 

1 A R ynTDEDRYP Q K L A F A E £__L_ 20 
Domain I Domain IT 

61 GTGCAT^GGCTGTATCGAT^ 120 

21 r p G C iDARTGRETAALNSVR 40 

Domain JI 

121 (OTGCTCCAGAGCCTX^^ 180 

41 L L Q S L t. v f. R R R P CSRDGSGL 60 
Domain IT I 

181 CCCCAC*CX7K3GGGttrmrc^ 240 

61 PTPGAPAFHTEFIH Y P V G C T- 8° 

Domain IV 

241 CKXTCTCCTGCCCCGTITC^^ 300 

81 C V L P R S V 87 
Domain TV 

301 GTGCTCCCC^GAGGGCACCCCCTATTT^ 360 

363 AACACTAC(XriTO3GGrcTGGGCATTC 420 

421 CTCATCTCCAGCCTCAC H'AGTTGCK^GGTVKJAAGGAGCTGAGC ACCfCTTCO AGCCCTT AA 480 

481 AGClXX^GAAAAGGTGTCACACGCXrKXXnOT 540 

541. TCCCrTACCCTATCflCTG^ 600 

601 ^ACCCCIKSTTTCITAAACAATTA^^ 6 60 



661 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA • 705 



Figure 2 
mt*rlmi)tin-22 



1 GGAATTCGGCACGAGCTCGTGCCGTC 60 

1 NSARARAVJ, S AFHHTLQLGP 20 

61 OXrGTCAGaVGGCGCGCAACGCGACCrGrc 120 

21 REQARNASC PAGGRPADRRF 40 

3 2 1 TCCGGCCGCCCACCAACCTGCGCAGCC^^ 180 

41 RPPTNLRSVSPWAYRI S Y D P 60 

Danviin I 

61 A R Y P R Y L P E A Y C I, C R G C L T G 80 
Domain I Domain II 

241 GGCTCTTCGGCGAGGAGGACGTGCGCTTCCGCAGCGCCC 300 

81 LFGEEDVRFRSAPVYMPT \L J£ 100 

Domain III 

301 TCCTGC\3CCGCACCCCCGCCT^ 360 

101 L R R T P ACAGGRSVYTEAYVT 120 
Domain III 

# . 

361 CCATC(XCGTK3GGCreCA^^ 420 

121 IPVGCTCV PEPEKDADSXN5 140 
Domain IV 



421 CCAGCATCGACAAACAGGGCGCCAAGCTCCTG^ 480 
141 SIDKQGAKLLLGPNDAPAGP 160 



481 CX^TGAGGCCGGTCCTGCCCCGGGAGGTCTCCCCGGCCCG^ 540 



541, GAGCCGCXnGGAGGGCTCGGTCGGCGACCTCT^ 600 



601 CCGGACXIACCAGCGCCGCCnTCCATGGACACTCCT 660 
661 TCCCTGGCTTGCTTTTAGCTACAAGCAAGCAGC^ 720 
721 CGGCACGGGCATCCTGTGTGCGGCOXX^TGGA 780 



781 CTGAGGAGCCTCTC AG ATCGGCTGCIX3CGGGTGCAGGGCGTGACTCACCGCTGGGTGC^ B 4 0 



841 GCCAAAGAGATAGGGACGCATATGCTTITTAAAGCAATCT 900 



Figure 2 (continued) 
Interleukin-22 

901 CXSACrATATACCTACTTTTAAAATCAACT 960 

961 ATCAAATGAGAGCTACTCTGTTACATTTCTTAACATA^ 1020 

1021 TCGTAGAATTTTTTAAAGCATAAITGGAAT^ 1060 

1081 TGGCCTGGGTCTCTGAATTCAGCCTGTCACC^ 1140 

1141 t^TCTX^CCCACTCTTCCTTC^ 1200 

1201 atgcacag<x:ggctcgcatgccccagggccagctaagag^ 1260 

„ ' 1261 TTTTAGTCATQAATACATAAACAfiTC^XZAAACTCGCACAAin 1320 

En 

o ...... 

D 1321 CC ACTGGCX5CC AATTTGTGGTT^ PCTTT 13B0 

05 

S] -.-..*• 

1383 GCXLAGTT^ITIAGAAGAATXrCAAGCAGGTATTG 1440 

•P ...... 

O 1441 CICAATATGAGGACAAAGTGGGCCACCTTAGC^ AGGCTTCT 1500 



□ 

\J\ 1501 GTTTCTGCATTCTGCCACGAGAGCTAGGTCCTTGATCTTVrCTTOT^TTGAAAffrCTGT 1560 

^ 1561 CTt:TCAACACMTTATTTCTAAAA^ 1620 

9 

1621 GCTGAAAAAAAAAAAAAAAAAA 1642 
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Figure 3 



Figure 4 

Intorleukin-21 Polypeptide Analysis 
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Figure 5 

Tnterleukin-22 Polypeptide Analysis 
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ON 04/24/01 THE ADDRESS OF RECORD FOR CUSTOMER NUMBER 22195 IS: 

HUMAN GENOME SCIENCES INC 
9410 KEY WEST AVENUE 
ROCKVILLE MD 20850 



AND THE PRACTITIONERS OF RECORD FOR CUSTOMER NUMBER 22195 ARE: 
40302 40582 41119 43710 43975 46789 



PTO INSTRUCTIONS: PLEASE TAKE THE FOLLOWING ACTION WHEN THE 
CORRESPONDENCE ADDRESS HAS BEEN CHANGED TO CUSTOMER NUMBER: 
RECORD, ON THE NEXT AVAILABLE CONTENTS LINE OF THE FILE JACKET . 
'ADDRESS CHANGE TO CUSTOMER NUMBER' . LINE THROUGH THE OLD 
ADDRESS ON THE FILE JACKET LABEL AND ENTER ONLY THE 'CUSTOMER 
NUMBER' AS THE NEW ADDRESS. FILE THIS LETTER IN THE FILE JACKET, 
WHEN ABOVE CHANGES ARE ONLY TO FEE ADDRESS AND/OR PRACTITIONERS 
OF RECORD, FILE LETTER IN THE FILE JACKET. 
THIS FILE IS ASSIGNED TO GAU 0000. 



PTO-FMD 
TALBOT- 1/97 



-.-V. * 



1 



